Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 25 Aug 1998 11:04:40 -0700 (PDT)
From:      David Wolfskill <dhw@whistle.com>
To:        freebsd-hardware@FreeBSD.ORG
Subject:   SCSI bus timeouts
Message-ID:  <199808251804.LAA23506@pau-amma.whistle.com>

next in thread | raw e-mail | index | archive | help
I have a system (that's used as a server) with one each narrow (Adaptec
2940U; ahc0) and wide (Adaptec 2940UW; ahc1) SCSI host adapters.

Each bus has an assortment of disk & tape devices, but the narrow bus
has only narrow devices, and the wide bus has only wide devices.  (Well,
actually, the DLT drive & changer is a single external package, with a
pair of 68-pin wide SCSI connectors, but it's my understanding that the
changer is actually a narrow device.)  Anyway, in each case, there are
devices on the bus inside the case and devices on the bus external to
the case, and I've turned off all termination on the host adapters.

I'm trying to use amanda to perform backups, using the (recently-
acquired) DLT drive & changer, and sufficiently often that it's Really
Annoying, I see SCSI bus timeouts & resets.

I'll append an example set of entries after my .sig (to avoid disrupting
the flow of my deathless prose :-).  And I'll precede it with an excerpt
from /var/run/dmesg.boot.

But the thing I was *just* noticing is that in many (though not all)
cases, the reported problems with the wide (ahc1) adapter are preceded
by reported problems with sd0 (ahc0:0:0) -- the drive that contains the
bulk of the OS... and which just happens to be a Quantum Empire 1080S.

Does it make sense to others that perhaps that Quantum Empire might be
triggering the vast bulk of the rest of this unpleasantness?  (I confess
to a certain bias, as I have one at home that was on my Sun 3/60 for a
while; I returned home after being out of town for a couple of weeks,
only to find that there had been a power outage in my absence, and the
drive had succumbed to the "stickation" problem for the last time.)

Any hints or tips welcomed.

Thanks,
david
-- 
David Wolfskill		UNIX System Administrator
dhw@whistle.com		voice: (650) 577-7158	pager: (650) 371-4621


>From /var/run/dmesg.boot:

FreeBSD 2.2.6-RELEASE #0: Mon Aug 10 17:11:23 PDT 1998
    dhw@shrimp.whistle.com:/usr/src/sys/compile/SHRIMP
CPU: Pentium Pro (199.31-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x617  Stepping=7
  Features=0xf9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV>
real memory  = 33554432 (32768K bytes)
avail memory = 30638080 (29920K bytes)
Probing for devices on PCI bus 0:
chip0 <Intel 82440FX (Natoma) PCI and memory controller> rev 2 on pci0:0:0
chip1 <Intel 82371SB PCI-ISA bridge> rev 1 on pci0:1:0
chip2 <Intel 82371SB IDE interface> rev 0 on pci0:1:1
pci0:1:2: Intel Corporation, device=0x7020, class=serial, subclass=0x03 int d irq 12 [no driver assigned]
ahc0 <Adaptec 2940 SCSI host adapter> rev 3 int a irq 12 on pci0:9:0
ahc0: aic7870 Single Channel, SCSI Id=7, 16 SCBs
ahc0 waiting for scsi devices to settle
(ahc0:0:0): "QUANTUM FIREBALL1080S 1Q09" type 0 fixed SCSI 2
sd0(ahc0:0:0): Direct-Access 1042MB (2134305 512 byte sectors)
(ahc0:2:0): "HP C3725S 6039" type 0 fixed SCSI 2
sd3(ahc0:2:0): Direct-Access 2047MB (4194058 512 byte sectors)
(ahc0:3:0): "HP C3725S 6039" type 0 fixed SCSI 2
sd4(ahc0:3:0): Direct-Access 2047MB (4194058 512 byte sectors)
(ahc0:4:0): "HP C1533A 9406" type 1 removable SCSI 2
st0(ahc0:4:0): Sequential-Access density code 0x24,  drive empty
(ahc0:5:0): "HP C1533A 9503" type 1 removable SCSI 2
st1(ahc0:5:0): Sequential-Access density code 0x24,  drive empty
(ahc0:6:0): "PLEXTOR CD-ROM PX-6XCS 2.06" type 5 removable SCSI 2
cd0(ahc0:6:0): CD-ROM can't get the size
...
ahc1 <Adaptec 2940 Ultra SCSI host adapter> rev 0 int a irq 11 on pci0:11:0
ahc1: aic7880 Wide Channel, SCSI Id=7, 16 SCBs
ahc1 waiting for scsi devices to settle
(ahc1:1:0): "Quantum XP31070W L912" type 0 fixed SCSI 2
sd1(ahc1:1:0): Direct-Access 1075MB (2203480 512 byte sectors)
(ahc1:2:0): "MICROP 4691WS T171" type 0 fixed SCSI 2
sd2(ahc1:2:0): Direct-Access 8681MB (17780058 512 byte sectors)
(ahc1:4:0): "Quantum DLT4000 D069" type 1 removable SCSI 2
st2(ahc1:4:0): Sequential-Access density code 0x1a, variable blocks, write-enabled
ahc1:A:6: refuses WIDE negotiation.  Using 8bit transfers
(ahc1:6:0): "ADIC FastStor DLT 0110" type 8 removable SCSI 2
ch0(ahc1:6:0): Medium-Changer 7 slots, 1 drive, 1 picker




>From /var/log/messages:

Aug 25 03:23:14 shrimp /kernel: sd0(ahc0:0:0): parity error during Data-In phase.
Aug 25 03:23:14 shrimp /kernel: sd0(ahc0:0:0): ABORTED COMMAND asc:48,0
Aug 25 03:23:14 shrimp /kernel: sd0(ahc0:0:0):  Initiator detected error message received
Aug 25 03:23:14 shrimp /kernel: , retries:4
Aug 25 03:23:37 shrimp /kernel: sd1(ahc1:1:0): Target Busy
Aug 25 03:23:47 shrimp /kernel: sd1(ahc1:1:0): SCB 0x4 - timed out in datain phase, SCSISIGI == 0x44
Aug 25 03:23:47 shrimp /kernel: SEQADDR = 0x48 SCSISEQ = 0x12 SSTAT0 = 0x7 SSTAT1 = 0x2
Aug 25 03:23:47 shrimp /kernel: sd1(ahc1:1:0): abort message in message buffer
Aug 25 03:23:47 shrimp /kernel: sd2(ahc1:2:0): SCB 0x0 timedout while recovery in progress
Aug 25 03:23:47 shrimp /kernel: sd2(ahc1:2:0): SCB 0x3 timedout while recovery in progress
Aug 25 03:23:49 shrimp /kernel: sd1(ahc1:1:0): SCB 0x1 - timed out in datain phase, SCSISIGI == 0x54
Aug 25 03:23:49 shrimp /kernel: SEQADDR = 0x48 SCSISEQ = 0x12 SSTAT0 = 0x7 SSTAT1 = 0x2
Aug 25 03:23:49 shrimp /kernel: sd1(ahc1:1:0): no longer in timeout
Aug 25 03:23:49 shrimp /kernel: ahc1: Issued Channel A Bus Reset. 4 SCBs aborted
Aug 25 03:23:49 shrimp /kernel: sd1(ahc1:1:0): UNIT ATTENTION asc:29,2 
Aug 25 03:23:49 shrimp /kernel: , retries:3
Aug 25 03:23:49 shrimp /kernel: sd2(ahc1:2:0): UNIT ATTENTION asc:29,0
Aug 25 03:23:49 shrimp /kernel: sd2(ahc1:2:0):  Power on, reset, or bus device reset occurred
Aug 25 03:23:49 shrimp /kernel: , retries:3
Aug 25 03:23:51 shrimp /kernel: st2(ahc1:4:0): UNIT ATTENTION asc:29,0
Aug 25 03:23:52 shrimp /kernel: st2(ahc1:4:0):  Power on, reset, or bus device reset occurred
Aug 25 03:29:12 shrimp /kernel: sd1(ahc1:1:0): Target Busy
Aug 25 03:29:22 shrimp /kernel: sd1(ahc1:1:0): SCB 0x3 - timed out in datain phase, SCSISIGI == 0x44
Aug 25 03:29:24 shrimp /kernel: SEQADDR = 0x48 SCSISEQ = 0x12 SSTAT0 = 0x7 SSTAT1 = 0x2
Aug 25 03:29:24 shrimp /kernel: sd1(ahc1:1:0): abort message in message buffer
Aug 25 03:29:24 shrimp /kernel: sd2(ahc1:2:0): SCB 0x1 timedout while recovery in progress
Aug 25 03:29:24 shrimp /kernel: sd2(ahc1:2:0): SCB 0x2 timedout while recovery in progress
Aug 25 03:29:24 shrimp /kernel: sd1(ahc1:1:0): SCB 0x4 - timed out in datain phase, SCSISIGI == 0x54
Aug 25 03:29:24 shrimp /kernel: SEQADDR = 0x48 SCSISEQ = 0x12 SSTAT0 = 0x7 SSTAT1 = 0x2
Aug 25 03:29:24 shrimp /kernel: sd1(ahc1:1:0): no longer in timeout
Aug 25 03:29:24 shrimp /kernel: ahc1: Issued Channel A Bus Reset. 4 SCBs aborted
Aug 25 03:29:24 shrimp /kernel: sd1(ahc1:1:0): UNIT ATTENTION asc:29,2 
Aug 25 03:29:24 shrimp /kernel: , retries:2
Aug 25 03:29:24 shrimp /kernel: sd2(ahc1:2:0): UNIT ATTENTION asc:29,0
Aug 25 03:29:24 shrimp /kernel: sd2(ahc1:2:0):  Power on, reset, or bus device reset occurred
Aug 25 03:29:24 shrimp /kernel: , retries:3

To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-hardware" in the body of the message



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?199808251804.LAA23506>