From owner-freebsd-scsi Sat Sep 14 17:52:10 2002 Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.FreeBSD.org (mx1.FreeBSD.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id E5E8A37B400 for ; Sat, 14 Sep 2002 17:52:05 -0700 (PDT) Received: from mailbox.univie.ac.at (mailbox.univie.ac.at [131.130.1.27]) by mx1.FreeBSD.org (Postfix) with ESMTP id A3B1D43E42 for ; Sat, 14 Sep 2002 17:52:04 -0700 (PDT) (envelope-from l.ertl@univie.ac.at) Received: from adslle.cc.univie.ac.at (adslle.cc.univie.ac.at [131.130.102.11]) by mailbox.univie.ac.at (8.12.2/8.12.2) with ESMTP id g8F0P64s274100 for ; Sun, 15 Sep 2002 02:25:08 +0200 Date: Sun, 15 Sep 2002 02:25:06 +0200 (CEST) From: Lukas Ertl X-X-Sender: le@leelou.in.tern To: freebsd-scsi@freebsd.org Subject: ahc error messages Message-ID: <20020915020641.Y48265-100000@leelou.in.tern> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=ISO-8859-1 Content-Transfer-Encoding: QUOTED-PRINTABLE Sender: owner-freebsd-scsi@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.org Hi, tonight one of my servers crashed. Connected to the server via an Adaptec 3960D SCSI adapter is an external RAID-5 box. This RAID box should be fully OS transparent and look like one big disk to the system (so the whole RAID magic is hidden away from the operating system). Well, one of the disks in this RAID box died. Shouldn't be a problem, the hot spare disk jumped in. (We tested that when we first installed this box: pulling disks out and putting them back in when the system was online worked.) Nevertheless the server crashed. Although I think that it's the fault of the RAID box (and the so-called "transparency" is just a marketing-gag), I hope that someone can explain the errors I found in /var/log/messages. It started with: ---8<--- Sep 15 00:13:11 ftp /kernel: (da1:ahc1:0:2:0): SCB 0x6a - timed out Sep 15 00:13:11 ftp /kernel: ahc1: Dumping Card State while idle, at SEQADDR 0x9 Sep 15 00:13:11 ftp /kernel: ACCUM =3D 0x0, SINDEX =3D 0x7e, DINDEX =3D 0xe= 4, ARG_2 =3D 0x0 Sep 15 00:13:11 ftp /kernel: HCNT =3D 0x0 SCBPTR =3D 0x3 Sep 15 00:13:11 ftp /kernel: SCSISEQ =3D 0x12, SBLKCTL =3D 0xa Sep 15 00:13:11 ftp /kernel: DFCNTRL =3D 0x0, DFSTATUS =3D 0x89 Sep 15 00:13:11 ftp /kernel: LASTPHASE =3D 0x1, SCSISIGI =3D 0x0, SXFRCTL0 = =3D 0x80 Sep 15 00:13:11 ftp /kernel: SSTAT0 =3D 0x0, SSTAT1 =3D 0x8 Sep 15 00:13:11 ftp /kernel: SCSIPHASE =3D 0x0 Sep 15 00:13:11 ftp /kernel: STACK =3D=3D 0x3, 0x108, 0x160, 0x0 Sep 15 00:13:11 ftp /kernel: SCB count =3D 254 Sep 15 00:13:11 ftp /kernel: Kernel NEXTQSCB =3D 29 Sep 15 00:13:12 ftp /kernel: Card NEXTQSCB =3D 29 Sep 15 00:13:12 ftp /kernel: QINFIFO entries: Sep 15 00:13:12 ftp /kernel: Waiting Queue entries: Sep 15 00:13:12 ftp /kernel: Disconnected Queue entries: 31:147 30:79 29:19 28:162 27:33 26:153 25:156 8:129 7:208 2:70 13:154 18:54 1:136 17:181= 19:51 21:190 9:170 14:178 10:86 22:232 24:185 4:58 20:69 6:16 0:5 15:116 16:118 1= 1:234 5:199 12:121 23:106 Sep 15 00:13:12 ftp /kernel: QOUTFIFO entries: Sep 15 00:13:12 ftp /kernel: Sequencer Free SCB List: 3 Sep 15 00:13:12 ftp /kernel: Sequencer SCB Info: 0(c 0x64, s 0x27, l 0, t 0x5) 1(c 0x64, s 0x27, l 0, t 0x88) 2(c 0x64, s 0x27, l 0, t 0x46) 3(c 0x60= , s 0x27, l 0, t 0xff) 4(c 0x64, s 0x27, l 0, t 0x3a) 5(c 0x64, s 0x27, l 0, t = 0xc7) 6(c 0x64, s 0x27, l 0, t 0x10) 7(c 0x64, s 0x27, l 0, t 0xd0) 8(c 0x64, s 0= x27, l 0, t 0x81) 9(c 0x64, s 0x27, l 0, t 0xaa) 10(c 0x64, s 0x27, l 0, t 0x56)= 11(c 0x64, s 0x27, l 0, t 0xea) 12(c 0x64, s 0x27, l 0, t 0x79) 13(c 0x64, s 0x2= 7, l 0, t 0x9a) 14(c 0x64, s 0x27, l 0, t 0xb2) 15(c 0x64, s 0x27, l 0, t 0x74) = 16(c 0x64, s 0x27, l 0, t 0x76) 17(c 0x64, s 0x27, l 0, t 0xb5) 18(c 0x64, s 0x2= 7, l 0, t 0x36) 19(c 0x64, s 0x27, l 0, t 0x33) 20(c 0x64, s 0x27, l 0, t 0x45) = 21(c 0x64, s 0x27, l 0, t 0xbe) 22(c 0x64, s 0x27, l 0, t 0xe8) 23(c 0x64, s 0x2= 7, l 0, t 0x6a) 24(c 0x64, s 0x27, l 0, t 0xb9) 25(c 0x64, s 0x27, l 0, t 0x9c) = 26(c 0x64, s 0x27, l 0, t 0x99) 27(c 0x64, s 0x27, l 0, t 0x21) 28(c 0x64, s 0x2= 7, l 0, t 0xa2) 29(c 0x64, s 0x27, l 0, t 0x13) 30(c 0x64, s 0x27, l 0, t 0x4f) = 31(c 0x64, s 0, t 0x93) Sep 15 00:13:12 ftp /kernel: Pending list: 147(c 0x60, s 0x27, l 0), 7(c 0x64, s 0x27, l 0), 18(c 0x64, s 0x27, l 0), 90(c 0x64, s 0x27, l 0), 230(c= 0x64, s 0x27, l 0), 248(c 0x64, s 0x27, l 0), 79(c 0x60, s 0x27, l 0), 19(c 0x60,= s 0x27, l 0), 162(c 0x60, s 0x27, l 0), 33(c 0x60, s 0x27, l 0), 153(c 0x60, = s 0x27, l 0), 156(c 0x60, s 0x27, l 0), 129(c 0x60, s 0x27, l 0), 208(c 0x60,= s 0x27, l 0), 70(c 0x60, s 0x27, l 0), 154(c 0x60, s 0x27, l 0), 54(c 0x60, s 0x27,= l 0), 136(c 0x60, s 0x27, l 0), 181(c 0x60, s 0x27, l 0), 51(c 0x60, s 0x27, = l 0), 190(c 0x60, s 0x27, l 0), 170(c 0x60, s 0x27, l 0), 178(c 0x60, s 0x27, l 0= ), 86(c 0x60, s 0x27, l 0), 232(c 0x60, s 0x27, l 0), 185(c 0x60, s 0x27, l 0)= , 58(c 0x60, s 0x27, l 0), 69(c 0x60, s 0x27, l 0), 16(c 0x60, s 0x27, l 0), 5(c 0= x60, s 0x27, l 0), 116(c 0x60, s 0x27, l 0), 118(c 0x60, s 0x27, l 0), 234(c 0x60,= s 0x27, l 0), 199(c 0x60, s 0x27, l 0), 121(c 0x60, s 0x27, l 0), 106(c 0x60,= s 0x27, l 0) Sep 15 00:13:12 ftp /kernel: Kernel Free SCB list: 126 68 81 180 1 164 224 48 52 231 242 161 177 247 131 189 188 84 105 71 74 191 138 175 205 82 0 142= 216 169 166 251 204 63 3 2 119 17 102 61 246 89 108 252 87 44 209 145 123 36 24= 22 73 65 141 6 98 21 88 99 245 222 10 64 75 97 67 155 35 132 160 201 219 14 172 23 50 115 214 212 127 76 113 195 135 159 207 107 182 236 104 31 146 143= 221 237 241 103 183 80 173 217 40 196 114 144 197 112 150 187 38 253 96 238 125 184 151= 233 192 93 229 148 27 200 57 128 53 239 243 26 213 171 133 206 186 137 235 163 = 100 9 211 41 223 66 11 101 117 39 193 92 32 168 249 12 225 59 62 122 130 83 56 20 47 124 220 72 13 140 134 55 244 49 110 109 250 4 176 78 46 203 111 167 1= 65 240 95 149 218 210 174 77 228 215 60 158 30 120 43 37 152 227 202 157 226 85 91 19= 4 139 179 94 15 45 8 34 25 198 42 28 Sep 15 00:13:12 ftp /kernel: sg[0] - Addr 0x4dc09000 : Length 4096 Sep 15 00:13:12 ftp /kernel: sg[1] - Addr 0x32530000 : Length 4096 Sep 15 00:13:12 ftp /kernel: sg[2] - Addr 0x49611000 : Length 4096 Sep 15 00:13:12 ftp /kernel: sg[3] - Addr 0x21ff2000 : Length 4096 Sep 15 00:13:12 ftp /kernel: sg[4] - Addr 0x993000 : Length 4096 Sep 15 00:13:12 ftp /kernel: sg[5] - Addr 0x13b94000 : Length 4096 Sep 15 00:13:12 ftp /kernel: sg[6] - Addr 0xcd95000 : Length 4096 Sep 15 00:13:12 ftp /kernel: sg[7] - Addr 0x411f6000 : Length 4096 Sep 15 00:13:12 ftp /kernel: sg[8] - Addr 0x43e97000 : Length 4096 Sep 15 00:13:12 ftp /kernel: sg[9] - Addr 0x193f8000 : Length 4096 Sep 15 00:13:12 ftp /kernel: sg[10] - Addr 0x7199000 : Length 4096 Sep 15 00:13:12 ftp /kernel: sg[11] - Addr 0x47cba000 : Length 4096 Sep 15 00:14:27 ftp /kernel: sg[12] - Addr 0xcb1b000 : Length 4096 Sep 15 00:14:27 ftp /kernel: sg[13] - Addr 0x152dc000 : Length 4096 Sep 15 00:14:27 ftp /kernel: (da1:ahc1:0:2:0): Queuing a BDR SCB Sep 15 00:14:27 ftp /kernel: (da1:ahc1:0:2:0): Bus Device Reset Message Sent Sep 15 00:14:27 ftp /kernel: (da1:ahc1:0:2:0): no longer in timeout, status =3D 34b Sep 15 00:14:27 ftp /kernel: ahc1: Bus Device Reset on A:2. 87 SCBs aborted ---8<--- Later came, repeatedly: ---8<--- Sep 15 00:15:32 ftp /kernel: CSIRATE(0x0) Sep 15 00:15:32 ftp /kernel: Unexpected non-DT Data Phase Sep 15 00:15:32 ftp /kernel: ahc1:A:2: parity error detected in DT Data-in phase. SEQADDR(0x1a3) SCSIRATE(0x0) ---8<--- And so on. Any hints and tips are appreciated. Please keep me CC'ed. best regards, le --=20 Lukas Ertl eMail: l.ertl@univie.ac.at UNIX-Systemadministrator Tel.: (+43 1) 4277-14073 Zentraler Informatikdienst (ZID) Fax.: (+43 1) 4277-9140 der Universit=E4t Wien http://mailbox.univie.ac.at/~le/ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-scsi" in the body of the message