Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 5 Sep 2002 15:33:55 -0700 (PDT)
From:      Paul English <penglish@hydro.washington.edu>
To:        freebsd-scsi@freebsd.org
Subject:   Arena IDE raid 99EX problem
Message-ID:  <20020905152315.K87374-100000@dynamic.hydro.washington.edu>

next in thread | raw e-mail | index | archive | help


Hi,
	I have an Arena IDE raid 99EX 6 bay desktop which was purchased
some time ago (approx 2 years +). It is operating in raid 5 mode with 6
maxtor 60 gb disks and is attached to a PC running FreeBSD
(4.4-RELEASE) with an Adaptec SCSI (7880) card.

I am seeing a problem when I access (attempt to copy, or just get
file properties) specific files in a specific directory - massive SCSI errors
are generated and the filesystem becomes inaccessible. Only a reboot
(and complete fsck) makes the filesystem available again. Since this is
happening below the filesystem level, I don't suspect that it is just a
filesystem error. I suspect that either we've encountered a bug in the
firmware code, or one (or more) of the disks has bad blocks, and it has not
been detected as a bad disk.

I have tried:
shutting down and restarting the host system and the raid array
swapping SCSI cables
moving the array to another system (also running 4.4-RELEASE)

My only remaining option seems to be copying all of the data off (except
that directory) and hoping that I don't discover other files with the same
problem in the process. Then I would reinitialize the array and copy it
all back. Perhaps as part of that process I should test the individual
disks in a machine to see if I can isolate a responsible one.

I don't have the know-how to tell what the SCSI errors are
indicating, but following are the SCSI errors generated (I included
only the first couple of what appear to be repetitions):

Sep  4 16:28:27 ephemeral /kernel: (da1:ahc0:0:10:0): SCB 0x42 - timed out
Sep  4 16:28:27 ephemeral /kernel: ahc0: Dumping Card State while idle, at
SEQAD
DR 0x8
Sep  4 16:28:27 ephemeral /kernel: ACCUM = 0x0, SINDEX = 0x1c, DINDEX =
0xe4, AR
G_2 = 0x0
Sep  4 16:28:27 ephemeral /kernel: HCNT = 0x0
Sep  4 16:28:27 ephemeral /kernel: SCSISEQ = 0x12, SBLKCTL = 0xa
Sep  4 16:28:27 ephemeral /kernel: DFCNTRL = 0x0, DFSTATUS = 0x89
Sep  4 16:28:27 ephemeral /kernel: LASTPHASE = 0x1, SCSISIGI = 0x0,
SXFRCTL0 = 0
x80
Sep  4 16:28:27 ephemeral /kernel: SSTAT0 = 0x0, SSTAT1 = 0xa
Sep  4 16:28:27 ephemeral /kernel: STACK == 0x3, 0x10d, 0x165, 0x0
Sep  4 16:28:27 ephemeral /kernel: SCB count = 80
Sep  4 16:28:27 ephemeral /kernel: Kernel NEXTQSCB = 42
Sep  4 16:28:27 ephemeral /kernel: Card NEXTQSCB = 42
Sep  4 16:28:27 ephemeral /kernel: QINFIFO entries:
Sep  4 16:28:27 ephemeral /kernel: Waiting Queue entries:
Sep  4 16:28:27 ephemeral /kernel: Disconnected Queue entries: 7:26 26:66
Sep  4 16:28:27 ephemeral /kernel: QOUTFIFO entries:
Sep  4 16:28:27 ephemeral /kernel: Sequencer Free SCB List: 1 16 0 17 8 25
15 14
 9 28 20 13 21 6 5 22 19 2 18 23 24 12 27 4 3 11 10 29 30 31
Sep  4 16:28:27 ephemeral /kernel: Pending list: 26, 66
Sep  4 16:28:27 ephemeral /kernel: Kernel Free SCB list: 28 40 35 65 29 34
37 58
 4 31 6 68 11 33 47 1 20 2 14 10 8 63 39 44 30 0 53 23 67 56 24 22 17 52
62 41 6
1 43 64 9 3 57 32 12 49 51 21 45 59 79 15 16 48 36 69 60 5 7 25 19 13 54
46 55 3
8 18 27 50 78 77 76 75 74 73 72 71 70
Sep  4 16:28:27 ephemeral /kernel: sg[0] - Addr 0xbd00000 : Length 4096
Sep  4 16:28:27 ephemeral /kernel: sg[1] - Addr 0x2ca1000 : Length 4096
Sep  4 16:28:27 ephemeral /kernel: sg[2] - Addr 0x1d82000 : Length 4096
Sep  4 16:28:27 ephemeral /kernel: sg[3] - Addr 0x6d03000 : Length 4096
Sep  4 16:28:27 ephemeral /kernel: sg[4] - Addr 0x2a84000 : Length 4096
Sep  4 16:28:27 ephemeral /kernel: sg[5] - Addr 0x2845000 : Length 4096
Sep  4 16:28:27 ephemeral /kernel: sg[6] - Addr 0x2046000 : Length 4096
Sep  4 16:28:27 ephemeral /kernel: sg[7] - Addr 0x1dce7000 : Length 4096
Sep  4 16:28:27 ephemeral /kernel: sg[8] - Addr 0x3148000 : Length 4096
Sep  4 16:28:27 ephemeral /kernel: sg[9] - Addr 0x42c9000 : Length 4096
Sep  4 16:28:27 ephemeral /kernel: sg[10] - Addr 0x1b9ca000 : Length 4096
Sep  4 16:28:27 ephemeral /kernel: sg[11] - Addr 0x1b80b000 : Length 4096
Sep  4 16:28:27 ephemeral /kernel: sg[12] - Addr 0x1bb8c000 : Length 4096
Sep  4 16:28:27 ephemeral /kernel: sg[13] - Addr 0x1b5ad000 : Length 4096
Sep  4 16:28:27 ephemeral /kernel: sg[14] - Addr 0x1e86e000 : Length 4096
Sep  4 16:28:27 ephemeral /kernel: (da1:ahc0:0:10:0): Queuing a BDR SCB
Sep  4 16:28:27 ephemeral /kernel: (da1:ahc0:0:10:0): Bus Device Reset
Message Sent
Sep  4 16:28:27 ephemeral /kernel: (da1:ahc0:0:10:0): no longer in
timeout, stat
us = 34b
Sep  4 16:28:27 ephemeral /kernel: ahc0: Bus Device Reset on A:10. 2 SCBs
aborte
d
Sep  4 16:29:42 ephemeral /kernel: (da1:ahc0:0:10:0): SCB 0x23 - timed out
Sep  4 16:29:42 ephemeral /kernel: ahc0: Dumping Card State while idle, at
SEQAD
DR 0x8
Sep  4 16:29:42 ephemeral /kernel: ACCUM = 0x0, SINDEX = 0x1c, DINDEX =
0xe4, AR
G_2 = 0x0
Sep  4 16:29:42 ephemeral /kernel: HCNT = 0x0
Sep  4 16:29:42 ephemeral /kernel: SCSISEQ = 0x12, SBLKCTL = 0xa
Sep  4 16:29:42 ephemeral /kernel: DFCNTRL = 0x0, DFSTATUS = 0x89
Sep  4 16:29:42 ephemeral /kernel: LASTPHASE = 0x1, SCSISIGI = 0x0,
SXFRCTL0 = 0
x80
Sep  4 16:29:42 ephemeral /kernel: SSTAT0 = 0x0, SSTAT1 = 0xa
Sep  4 16:29:42 ephemeral /kernel: STACK == 0x3, 0x10d, 0x165, 0xec
Sep  4 16:29:42 ephemeral /kernel: SCB count = 80
Sep  4 16:29:42 ephemeral /kernel: Kernel NEXTQSCB = 65
Sep  4 16:29:42 ephemeral /kernel: Card NEXTQSCB = 65
Sep  4 16:29:42 ephemeral /kernel: QINFIFO entries:
Sep  4 16:29:42 ephemeral /kernel: Waiting Queue entries:
Sep  4 16:29:42 ephemeral /kernel: Disconnected Queue entries: 1:26 15:35
Sep  4 16:29:42 ephemeral /kernel: QOUTFIFO entries:
Sep  4 16:29:42 ephemeral /kernel: Sequencer Free SCB List: 17 9 25 0 14
26 8 16
 7 28 20 13 21 6 5 22 19 2 18 23 24 12 27 4 3 11 10 29 30 31
Sep  4 16:29:42 ephemeral /kernel: Pending list: 26, 35
Sep  4 16:29:42 ephemeral /kernel: Kernel Free SCB list: 28 58 37 4 42 40
34 29
66 31 6 68 11 33 47 1 20 2 14 10 8 63 39 44 30 0 53 23 67 56 24 22 17 52
62 41 6
1 43 64 9 3 57 32 12 49 51 21 45 59 79 15 16 48 36 69 60 5 7 25 19 13 54
46 55 3
8 18 27 50 78 77 76 75 74 73 72 71 70
Sep  4 16:29:42 ephemeral /kernel: sg[0] - Addr 0xbd00000 : Length 4096
Sep  4 16:29:42 ephemeral /kernel: sg[1] - Addr 0x2ca1000 : Length 4096
Sep  4 16:29:42 ephemeral /kernel: sg[2] - Addr 0x1d82000 : Length 4096
Sep  4 16:29:42 ephemeral /kernel: sg[3] - Addr 0x6d03000 : Length 4096
Sep  4 16:29:42 ephemeral /kernel: sg[4] - Addr 0x2a84000 : Length 4096
Sep  4 16:29:42 ephemeral /kernel: sg[5] - Addr 0x2845000 : Length 4096
Sep  4 16:29:42 ephemeral /kernel: sg[6] - Addr 0x2046000 : Length 4096
Sep  4 16:29:42 ephemeral /kernel: sg[7] - Addr 0x1dce7000 : Length 4096
Sep  4 16:29:42 ephemeral /kernel: sg[8] - Addr 0x3148000 : Length 4096
Sep  4 16:29:42 ephemeral /kernel: sg[9] - Addr 0x42c9000 : Length 4096
Sep  4 16:29:42 ephemeral /kernel: sg[10] - Addr 0x1b9ca000 : Length 4096
Sep  4 16:29:42 ephemeral /kernel: sg[11] - Addr 0x1b80b000 : Length 4096
Sep  4 16:29:42 ephemeral /kernel: sg[12] - Addr 0x1bb8c000 : Length 4096
Sep  4 16:29:42 ephemeral /kernel: sg[13] - Addr 0x1b5ad000 : Length 4096
Sep  4 16:29:42 ephemeral /kernel: sg[14] - Addr 0x1e86e000 : Length 4096
Sep  4 16:29:42 ephemeral /kernel: (da1:ahc0:0:10:0): Queuing a BDR SCB
Sep  4 16:29:42 ephemeral /kernel: (da1:ahc0:0:10:0): Bus Device Reset
Message S
ent
Sep  4 16:29:42 ephemeral /kernel: (da1:ahc0:0:10:0): no longer in
timeout, stat
us = 34b
Sep  4 16:29:42 ephemeral /kernel: ahc0: Bus Device Reset on A:10. 2 SCBs
aborte
d
Sep  4 16:30:57 ephemeral /kernel: (da1:ahc0:0:10:0): SCB 0x41 - timed out
Sep  4 16:30:57 ephemeral /kernel: ahc0: Dumping Card State while idle, at
SEQAD
DR 0x9
Sep  4 16:30:57 ephemeral /kernel: ACCUM = 0x0, SINDEX = 0x1c, DINDEX =
0xe4, AR
G_2 = 0x0
Sep  4 16:30:57 ephemeral /kernel: HCNT = 0x0
Sep  4 16:30:57 ephemeral /kernel: SCSISEQ = 0x12, SBLKCTL = 0xa
Sep  4 16:30:57 ephemeral /kernel: DFCNTRL = 0x0, DFSTATUS = 0x89
Sep  4 16:30:57 ephemeral /kernel: LASTPHASE = 0x1, SCSISIGI = 0x0,
SXFRCTL0 = 0
x80
Sep  4 16:30:57 ephemeral /kernel: SSTAT0 = 0x0, SSTAT1 = 0xa
Sep  4 16:30:57 ephemeral /kernel: STACK == 0x3, 0x10d, 0x165, 0xec
Sep  4 16:30:57 ephemeral /kernel: SCB count = 80
Sep  4 16:30:57 ephemeral /kernel: Kernel NEXTQSCB = 4
Sep  4 16:30:57 ephemeral /kernel: Card NEXTQSCB = 4
Sep  4 16:30:57 ephemeral /kernel: QINFIFO entries:
Sep  4 16:30:57 ephemeral /kernel: Waiting Queue entries:
Sep  4 16:30:57 ephemeral /kernel: Disconnected Queue entries: 1:35 15:65
Sep  4 16:30:57 ephemeral /kernel: QOUTFIFO entries:
Sep  4 16:30:57 ephemeral /kernel: Sequencer Free SCB List: 9 0 25 17 14
26 8 16
 7 28 20 13 21 6 5 22 19 2 18 23 24 12 27 4 3 11 10 29 30 31
Sep  4 16:30:57 ephemeral /kernel: Pending list: 35, 65
Sep  4 16:30:57 ephemeral /kernel: Kernel Free SCB list: 28 40 42 29 34 26
37 58
 66 31 6 68 11 33 47 1 20 2 14 10 8 63 39 44 30 0 53 23 67 56 24 22 17 52
62 41
61 43 64 9 3 57 32 12 49 51 21 45 59 79 15 16 48 36 69 60 5 7 25 19 13 54
46 55
38 18 27 50 78 77 76 75 74 73 72 71 70
Sep  4 16:30:57 ephemeral /kernel: sg[0] - Addr 0xbd00000 : Length 4096
Sep  4 16:30:57 ephemeral /kernel: sg[1] - Addr 0x2ca1000 : Length 4096
Sep  4 16:30:57 ephemeral /kernel: sg[2] - Addr 0x1d82000 : Length 4096
Sep  4 16:30:57 ephemeral /kernel: sg[3] - Addr 0x6d03000 : Length 4096
Sep  4 16:30:57 ephemeral /kernel: sg[4] - Addr 0x2a84000 : Length 4096
Sep  4 16:30:57 ephemeral /kernel: sg[5] - Addr 0x2845000 : Length 4096
Sep  4 16:30:57 ephemeral /kernel: sg[6] - Addr 0x2046000 : Length 4096
Sep  4 16:30:57 ephemeral /kernel: sg[7] - Addr 0x1dce7000 : Length 4096
Sep  4 16:30:57 ephemeral /kernel: sg[8] - Addr 0x3148000 : Length 4096
Sep  4 16:30:57 ephemeral /kernel: sg[9] - Addr 0x42c9000 : Length 4096
Sep  4 16:30:57 ephemeral /kernel: sg[10] - Addr 0x1b9ca000 : Length 4096
Sep  4 16:30:57 ephemeral /kernel: sg[11] - Addr 0x1b80b000 : Length 4096
Sep  4 16:30:57 ephemeral /kernel: sg[12] - Addr 0x1bb8c000 : Length 4096
Sep  4 16:30:57 ephemeral /kernel: sg[13] - Addr 0x1b5ad000 : Length 4096
Sep  4 16:30:57 ephemeral /kernel: sg[14] - Addr 0x1e86e000 : Length 4096
Sep  4 16:30:57 ephemeral /kernel: (da1:ahc0:0:10:0): Queuing a BDR SCB
Sep  4 16:30:57 ephemeral /kernel: (da1:ahc0:0:10:0): Bus Device Reset
Message S
ent
Sep  4 16:30:57 ephemeral /kernel: (da1:ahc0:0:10:0): no longer in
timeout, stat
us = 34b
Sep  4 16:30:57 ephemeral /kernel: ahc0: Bus Device Reset on A:10. 2 SCBs
aborte
d
Sep  4 16:32:12 ephemeral /kernel: (da1:ahc0:0:10:0): SCB 0x22 - timed out
Sep  4 16:32:12 ephemeral /kernel: ahc0: Dumping Card State while idle, at
SEQAD
DR 0x8
Sep  4 16:32:12 ephemeral /kernel: ACCUM = 0x0, SINDEX = 0x6, DINDEX =
0xe4, ARG
_2 = 0x0
Sep  4 16:32:12 ephemeral /kernel: HCNT = 0x0
Sep  4 16:32:12 ephemeral /kernel: SCSISEQ = 0x12, SBLKCTL = 0xa
Sep  4 16:32:12 ephemeral /kernel: DFCNTRL = 0x0, DFSTATUS = 0x89
Sep  4 16:32:12 ephemeral /kernel: LASTPHASE = 0x1, SCSISIGI = 0x0,
SXFRCTL0 = 0
x80
Sep  4 16:32:12 ephemeral /kernel: SSTAT0 = 0x0, SSTAT1 = 0xa
Sep  4 16:32:12 ephemeral /kernel: STACK == 0x3, 0x10d, 0x165, 0xec
Sep  4 16:32:12 ephemeral /kernel: SCB count = 80
Sep  4 16:32:12 ephemeral /kernel: Kernel NEXTQSCB = 42
Sep  4 16:32:12 ephemeral /kernel: QINFIFO entries:
Sep  4 16:32:12 ephemeral /kernel: Waiting Queue entries:
Sep  4 16:32:12 ephemeral /kernel: Disconnected Queue entries: 1:26 9:34
Sep  4 16:32:12 ephemeral /kernel: QOUTFIFO entries:
Sep  4 16:32:12 ephemeral /kernel: Sequencer Free SCB List: 0 26 17 15 25
7 16 8
 14 28 20 13 21 6 5 22 19 2 18 23 24 12 27 4 3 11 10 29 30 31
Sep  4 16:32:12 ephemeral /kernel: Pending list: 26, 34
Sep  4 16:32:12 ephemeral /kernel: Kernel Free SCB list: 6 65 47 58 29 33
11 68
40 4 35 28 37 66 31 1 20 2 14 10 8 63 39 44 30 0 53 23 67 56 24 22 17 52
62 41 6
1 43 64 9 3 57 32 12 49 51 21 45 59 79 15 16 48 36 69 60 5 7 25 19 13 54
46 55 3
8 18 27 50 78 77 76 75 74 73 72 71 70
Sep  4 16:32:12 ephemeral /kernel: sg[0] - Addr 0xbd00000 : Length 4096
Sep  4 16:32:12 ephemeral /kernel: sg[1] - Addr 0x2ca1000 : Length 4096
Sep  4 16:32:12 ephemeral /kernel: sg[2] - Addr 0x1d82000 : Length 4096
Sep  4 16:32:12 ephemeral /kernel: sg[3] - Addr 0x6d03000 : Length 4096
Sep  4 16:32:12 ephemeral /kernel: sg[4] - Addr 0x2a84000 : Length 4096
Sep  4 16:32:12 ephemeral /kernel: sg[5] - Addr 0x2845000 : Length 4096
Sep  4 16:32:12 ephemeral /kernel: sg[6] - Addr 0x2046000 : Length 4096
Sep  4 16:32:12 ephemeral /kernel: sg[7] - Addr 0x1dce7000 : Length 4096
Sep  4 16:32:12 ephemeral /kernel: sg[8] - Addr 0x3148000 : Length 4096
Sep  4 16:32:12 ephemeral /kernel: sg[9] - Addr 0x42c9000 : Length 4096
Sep  4 16:32:12 ephemeral /kernel: sg[10] - Addr 0x1b9ca000 : Length 4096
Sep  4 16:32:12 ephemeral /kernel: sg[11] - Addr 0x1b80b000 : Length 4096
Sep  4 16:32:12 ephemeral /kernel: sg[12] - Addr 0x1bb8c000 : Length 4096
Sep  4 16:32:12 ephemeral /kernel: sg[13] - Addr 0x1b5ad000 : Length 4096
Sep  4 16:32:12 ephemeral /kernel: sg[14] - Addr 0x1e86e000 : Length 4096
Sep  4 16:32:12 ephemeral /kernel: (da1:ahc0:0:10:0): Queuing a BDR SCB
Sep  4 16:32:12 ephemeral /kernel: (da1:ahc0:0:10:0): Bus Device Reset
Message S
ent
Sep  4 16:32:12 ephemeral /kernel: (da1:ahc0:0:10:0): no longer in
timeout, stat
us = 34b
Sep  4 16:32:12 ephemeral /kernel: ahc0: Bus Device Reset on A:10. 2 SCBs
aborte
d
Sep  4 16:33:27 ephemeral /kernel: (da1:ahc0:0:10:0): SCB 0x2a - timed out
Sep  4 16:33:27 ephemeral /kernel: ahc0: Dumping Card State while idle, at
SEQAD
DR 0x9
Sep  4 16:33:27 ephemeral /kernel: ACCUM = 0x0, SINDEX = 0x41, DINDEX =
0xe4, AR
G_2 = 0x0
Sep  4 16:33:27 ephemeral /kernel: HCNT = 0x0
Sep  4 16:33:27 ephemeral /kernel: SCSISEQ = 0x12, SBLKCTL = 0xa
Sep  4 16:33:27 ephemeral /kernel: DFCNTRL = 0x0, DFSTATUS = 0x89
Sep  4 16:33:27 ephemeral /kernel: LASTPHASE = 0x1, SCSISIGI = 0x0,
SXFRCTL0 = 0
x80
Sep  4 16:33:27 ephemeral /kernel: SSTAT0 = 0x0, SSTAT1 = 0xa
Sep  4 16:33:27 ephemeral /kernel: STACK == 0x3, 0x10d, 0x165, 0xec
Sep  4 16:33:27 ephemeral /kernel: SCB count = 80
Sep  4 16:33:27 ephemeral /kernel: Kernel NEXTQSCB = 68
Sep  4 16:33:27 ephemeral /kernel: Card NEXTQSCB = 68
Sep  4 16:33:27 ephemeral /kernel: QINFIFO entries:
Sep  4 16:33:27 ephemeral /kernel: Waiting Queue entries:
Sep  4 16:33:27 ephemeral /kernel: Disconnected Queue entries: 1:34 9:42
Sep  4 16:33:27 ephemeral /kernel: QOUTFIFO entries:
Sep  4 16:33:27 ephemeral /kernel: Sequencer Free SCB List: 17 14 8 7 15
26 16 2
5 0 28 20 13 21 6 5 22 19 2 18 23 24 12 27 4 3 11 10 29 30 31
Sep  4 16:33:27 ephemeral /kernel: Pending list: 34, 42
Sep  4 16:33:27 ephemeral /kernel: Kernel Free SCB list: 65 40 47 29 11 6
33 58
26 4 35 28 37 66 31 1 20 2 14 10 8 63 39 44 30 0 53 23 67 56 24 22 17 52
62 41 6
1 43 64 9 3 57 32 12 49 51 21 45 59 79 15 16 48 36 69 60 5 7 25 19 13 54
46 55 3
8 18 27 50 78 77 76 75 74 73 72 71 70
Sep  4 16:33:27 ephemeral /kernel: sg[0] - Addr 0xbd00000 : Length 4096
Sep  4 16:33:27 ephemeral /kernel: sg[1] - Addr 0x2ca1000 : Length 4096
Sep  4 16:33:27 ephemeral /kernel: sg[2] - Addr 0x1d82000 : Length 4096
Sep  4 16:33:27 ephemeral /kernel: sg[3] - Addr 0x6d03000 : Length 4096
Sep  4 16:33:27 ephemeral /kernel: sg[4] - Addr 0x2a84000 : Length 4096
Sep  4 16:33:27 ephemeral /kernel: sg[5] - Addr 0x2845000 : Length 4096
Sep  4 16:33:27 ephemeral /kernel: sg[6] - Addr 0x2046000 : Length 4096
Sep  4 16:33:27 ephemeral /kernel: sg[7] - Addr 0x1dce7000 : Length 4096
Sep  4 16:33:27 ephemeral /kernel: sg[8] - Addr 0x3148000 : Length 4096



Thanks in advance for any help,
Paul


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-scsi" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20020905152315.K87374-100000>