Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 25 Oct 2000 15:44:36 +0200 (SAT)
From:      John Hay <jhay@icomtek.csir.co.za>
To:        jhay@icomtek.csir.co.za (John Hay)
Cc:        freebsd-scsi@FreeBSD.ORG
Subject:   Re: ahc still broken?
Message-ID:  <200010251344.e9PDia430080@zibbi.icomtek.csir.co.za>
In-Reply-To: <200010200736.e9K7aiQ58036@zibbi.mikom.csir.co.za> from John Hay at "Oct 20, 2000 09:36:44 am"

next in thread | previous in thread | raw e-mail | index | archive | help
Well here is an update on the problem. I left everything at -stable and
just took sys/dev/aic7xxx back to 2000-09-15 and adjusted conf/files
to cater for that, built a new kernel and my problems are gone. I have
made 5 releases now and haven't had a single problem.

John
-- 
John Hay -- John.Hay@icomtek.csir.co.za

> Hi,
> 
> Is the ahc driver still broken in -stable?
> 
> I have a Dell machine that have been building -stable snapshosts every
> night for months without a problem (using a kernel and user level of
> April 14). Two days ago we had a prower failure and I thought it might
> be a good time to upgrade the machine to the latest -stable, but now
> it gives scsi errors and panic when building the -stable snapshots.
> Previously I haven't seen a single scsi error and the machine had been
> up for months at a time.
> 
> Here is the console output of the boot, scsi errors and panic I have
> captured through the serial port.
> 
> John
> --
> John Hay -- John.Hay@icomtek.csir.co.za
> 
> Console: serial port
> BIOS drive A: is disk0
> BIOS drive C: is disk1
> BIOS 640kB/130040kB available memory
> 
> FreeBSD/i386 bootstrap loader, Revision 0.8
> (jhay@dolphin.mikom.csir.co.za, Wed Oct 18 13:30:42 SAST 2000)
> Loading /boot/defaults/loader.conf 
> /kernel text=0x1824a1 data=0x427e0+0x1bc18 syms=[0x4+0x29420+0x4+0x2d924]
> 
> Hit [Enter] to boot immediately, or any other key for command prompt.
> Copyright (c) 1992-2000 The FreeBSD Project.
> Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
> 	The Regents of the University of California. All rights reserved.
> FreeBSD 4.1.1-STABLE #0: Wed Oct 18 14:15:22 SAST 2000
>     jhay@dolphin.mikom.csir.co.za:/usr/src/sys/compile/DOLPHIN
> Timecounter "i8254"  frequency 1193146 Hz
> Timecounter "TSC"  frequency 348916883 Hz
> CPU: Pentium II/Pentium II Xeon/Celeron (348.92-MHz 686-class CPU)
>   Origin = "GenuineIntel"  Id = 0x652  Stepping = 2
>   Features=0x183f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR>
> real memory  = 134209536 (131064K bytes)
> avail memory = 127447040 (124460K bytes)
> Preloaded elf kernel "kernel" at 0xc033a000.
> Pentium Pro MTRR support enabled
> npx0: <math processor> on motherboard
> npx0: INT 16 interface
> pcib0: <Intel 82443BX (440 BX) host to PCI bridge> on motherboard
> pci0: <PCI bus> on pcib0
> pcib1: <Intel 82443BX (440 BX) PCI-PCI (AGP) bridge> at device 1.0 on pci0
> pci1: <PCI bus> on pcib1
> pci1: <ATI Mach64-GB graphics accelerator> at 0.0 irq 11
> isab0: <Intel 82371AB PCI to ISA bridge> at device 7.0 on pci0
> isa0: <ISA bus> on isab0
> atapci0: <Intel PIIX4 ATA33 controller> port 0xffa0-0xffaf at device 7.1 on pci0
> ata1: at 0x170 irq 15 on atapci0
> pci0: <Intel 82371AB/EB (PIIX4) USB controller> at 7.2 irq 11
> intpm0: <Intel 82371AB Power management controller> port 0x850-0x85f irq 9 at device 7.3 on pci0
> intpm0: I/O mapped 850
> intpm0: intr IRQ 9 enabled revision 0
> smbus0: <System Management Bus> on intsmb0
> smb0: <SMBus general purpose I/O> on smbus0
> intpm0: PM I/O mapped 800 
> pcib2: <DEC 21152 PCI-PCI bridge> at device 15.0 on pci0
> pci2: <PCI bus> on pcib2
> ahc0: <Adaptec 2940/DUAL Ultra SCSI adapter> port 0xdc00-0xdcff mem 0xfafff000-0xfaffffff irq 11 at device 10.0 on pci2
> aic7895: Wide Channel A, SCSI Id=7, 32/255 SCBs
> ahc1: <Adaptec 2940/DUAL Ultra SCSI adapter> port 0xd800-0xd8ff mem 0xfaffe000-0xfaffefff irq 11 at device 10.1 on pci2
> aic7895: Single Channel B, SCSI Id=7, 32/255 SCBs
> xl0: <3Com 3c905B-TX Fast Etherlink XL> port 0xcc00-0xcc7f mem 0xff000000-0xff00007f irq 11 at device 17.0 on pci0
> xl0: Ethernet address: 00:c0:4f:71:b3:ab
> miibus0: <MII bus> on xl0
> xlphy0: <3Com internal media interface> on miibus0
> xlphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
> fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
> fdc0: FIFO enabled, 8 bytes threshold
> fd0: <1440-KB 3.5" drive> on fdc0 drive 0
> atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
> atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
> kbd0 at atkbd0
> psm0: <PS/2 Mouse> irq 12 on atkbdc0
> psm0: model Generic PS/2 mouse, device ID 0
> vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
> sc0: <System console> at flags 0x100 on isa0
> sc0: VGA <16 virtual consoles, flags=0x100>
> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
> sio0: type 16550A, console
> sio1 at port 0x2f8-0x2ff irq 3 on isa0
> sio1: type 16550A
> ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0
> ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode
> ppc0: FIFO with 16/16/8 bytes threshold
> lpt0: <Printer> on ppbus0
> lpt0: Interrupt-driven port
> ppi0: <Parallel I/O> on ppbus0
> plip0: <PLIP network interface> on ppbus0
> pcm0: <CS423x> at port 0x534-0x537,0x388-0x38b,0x220-0x22f irq 5 drq 1,0 on isa0
> acd0: CDROM <TOSHIBA CD-ROM XM-6202B> at ata1-master using WDMA2
> Waiting 2 seconds for SCSI devices to settle
> Mounting root from ufs:/dev/da0s1a
> da0 at ahc0 bus 0 target 0 lun 0
> da0: <QUANTUM VIKING II 9.1WLS 5520> Fixed Direct Access SCSI-2 device 
> da0: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing Enabled
> da0: 8709MB (17836668 512 byte sectors: 255H 63S/T 1110C)
> WARNING: / was not properly dismounted
> swapon: adding /dev/da0s1b as swap device
> Automatic boot in progress...
> /dev/da0s1a: 1443 files, 38661 used, 60522 free (522 frags, 7500 blocks, 0.5% fragmentation)
> /dev/da0s1f: 101065 files, 1289928 used, 742695 free (41839 frags, 87607 blocks, 2.1% fragmentation)
> /dev/da0s1g: UNREF FILE I=79410  OWNER=root MODE=100555
> /dev/da0s1g: SIZE=445920 MTIME=Oct 19 06:10 2000  (CLEARED)
> /dev/da0s1g: UNREF FILE I=517315  OWNER=root MODE=100644
> /dev/da0s1g: SIZE=9311 MTIME=Oct 19 10:02 2000  (CLEARED)
> /dev/da0s1g: FREE BLK COUNT(S) WRONG IN SUPERBLK (SALVAGED)
> /dev/da0s1g: SUMMARY INFORMATION BAD (SALVAGED)
> /dev/da0s1g: BLK(S) MISSING IN BIT MAPS (SALVAGED)
> /dev/da0s1g: 103481 files, 1421325 used, 1627617 free (26329 frags, 200161 blocks, 0.9% fragmentation)
> /dev/da0s1h: 142048 files, 1809221 used, 1057687 free (5335 frags, 131544 blocks, 0.2% fragmentation)
> /dev/da0s1e: 900 files, 29997 used, 69186 free (1282 frags, 8488 blocks, 1.3% fragmentation)
> Doing initial network setup: hostname.
> xl0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> mtu 1500
> 	inet 146.64.28.14 netmask 0xffffff00 broadcast 146.64.28.255
> 	inet6 fe80::2c0:4fff:fe71:b3ab%xl0 prefixlen 64 tentative scopeid 0x1 
> 	ether 00:c0:4f:71:b3:ab 
> 	media: autoselect (100baseTX <full-duplex>) status: active
> 	supported media: autoselect 100baseTX <full-duplex> 100baseTX 10baseT/UTP <full-duplex> 10baseT/UTP 100baseTX <hw-loopback>
> lo0: flags=8049<UP,LOOPBACK,RUNNING,MULTICAST> mtu 16384
> 	inet6 fe80::1%lo0 prefixlen 64 scopeid 0x9 
> 	inet6 ::1 prefixlen 128 
> 	inet 127.0.0.1 netmask 0xff000000 
> add net default: gateway 146.64.28.30
> Additional routing options: tcp extensions=NO TCP keepalive=YES.
> routing daemons:.
> Doing IPv6 network setup:add net ::ffff:0.0.0.0: gateway ::1
> add net ::0.0.0.0: gateway ::1
> net.inet6.ip6.forwarding: 0 -> 0
> net.inet6.ip6.accept_rtadv: 0 -> 1
> add net fe80::: gateway fe80::2c0:4fff:fe71:b3ab%xl0
> add net ff02::: gateway fe80::2c0:4fff:fe71:b3ab%xl0
>  IPv4 mapped IPv6 address support=YES.
> clearing /tmp
> additional daemons: syslogd.
> checking for core dump...savecore: no core dump
> Doing additional network setup: ntpd.
> Starting final network daemons:.
> setting ELF ldconfig path: /usr/lib /usr/lib/compat /usr/X11R6/lib /usr/local/lib
> setting a.out ldconfig path: /usr/lib/aout /usr/lib/compat/aout /usr/X11R6/lib/aout
> starting standard daemons: cron printer sendmail sshd.
> Initial rc.i386 initialization:.
> rc.i386 configuring syscons: blank_time moused.
> additional ABI support: linux.
> starting local daemons:starting local daemons:.
> .
> Local package initialization:.
> Additional TCP options:.
> Thu Oct 19 10:20:05 SAST 2000
> Oct 19 10:58:31 dolphin /kernel: /mnt: optimization changed from SPACE to TIME
> Oct 19 10:59:40 dolphin /kernel: /mnt: optimization changed from SPACE to TIME
> Oct 19 11:00:39 dolphin /kernel: /mnt: optimization changed from SPACE to TIME
> Oct 19 11:01:46 dolphin /kernel: /mnt: optimization changed from SPACE to TIME
> Oct 19 15:20:41 dolphin /kernel: /mnt: optimization changed from SPACE to TIME
> Oct 19 15:28:08 dolphin /kernel: /mnt: optimization changed from SPACE to TIME
> Oct 19 15:29:24 dolphin last message repeated 2 times
> 
> (da0:ahc0:0:0:0): data overrun detected in Data-out phase.  Tag == 0x32.
> (da0:ahc0:0:0:0): Have seen Data Phase.  Length = 65536.  NumSGs = 16.
> sg[0] - Addr 0x328e000 : Length 4096
> sg[1] - Addr 0x440f000 : Length 4096
> sg[2] - Addr 0x2f70000 : Length 4096
> sg[3] - Addr 0xd91000 : Length 4096
> sg[4] - Addr 0x3872000 : Length 4096
> sg[5] - Addr 0x9f3000 : Length 4096
> sg[6] - Addr 0x4394000 : Length 4096
> sg[7] - Addr 0x3b55000 : Length 4096
> sg[8] - Addr 0xcb6000 : Length 4096
> sg[9] - Addr 0x33b7000 : Length 4096
> sg[10] - Addr 0x40f8000 : Length 4096
> sg[11] - Addr 0x2879000 : Length 4096
> sg[12] - Addr 0x6d3a000 : Length 4096
> sg[13] - Addr 0x56db000 : Length 4096
> sg[14] - Addr 0x3d7c000 : Length 4096
> sg[15] - Addr 0x735d000 : Length 4096
> (da0:ahc0:0:0:0): data overrun detected in Data-out phase.  Tag == 0x32.
> (da0:ahc0:0:0:0): Have seen Data Phase.  Length = 65536.  NumSGs = 16.
> sg[0] - Addr 0x328e000 : Length 4096
> sg[1] - Addr 0x440f000 : Length 4096
> sg[2] - Addr 0x2f70000 : Length 4096
> sg[3] - Addr 0xd91000 : Length 4096
> sg[4] - Addr 0x3872000 : Length 4096
> sg[5] - Addr 0x9f3000 : Length 4096
> sg[6] - Addr 0x4394000 : Length 4096
> sg[7] - Addr 0x3b55000 : Length 4096
> sg[8] - Addr 0xcb6000 : Length 4096
> sg[9] - Addr 0x33b7000 : Length 4096
> sg[10] - Addr 0x40f8000 : Length 4096
> sg[11] - Addr 0x2879000 : Length 4096
> sg[12] - Addr 0x6d3a000 : Length 4096
> sg[13] - Addr 0x56db000 : Length 4096
> sg[14] - Addr 0x3d7c000 : Length 4096
> sg[15] - Addr 0x735d000 : Length 4096
> (da0:ahc0:0:0:0): data overrun detected in Data-out phase.  Tag == 0x4.
> (da0:ahc0:0:0:0): Have seen Data Phase.  Length = 65536.  NumSGs = 16.
> sg[0] - Addr 0x12fe000 : Length 4096
> sg[1] - Addr 0x61bf000 : Length 4096
> sg[2] - Addr 0x6a80000 : Length 4096
> sg[3] - Addr 0x67a1000 : Length 4096
> sg[4] - Addr 0x2f22000 : Length 4096
> sg[5] - Addr 0x6e63000 : Length 4096
> sg[6] - Addr 0x3d04000 : Length 4096
> sg[7] - Addr 0x40c5000 : Length 4096
> sg[8] - Addr 0x45c6000 : Length 4096
> sg[9] - Addr 0x61a7000 : Length 4096
> sg[10] - Addr 0x7b48000 : Length 4096
> sg[11] - Addr 0x1009000 : Length 4096
> sg[12] - Addr 0x44ea000 : Length 4096
> sg[13] - Addr 0x49eb000 : Length 4096
> sg[14] - Addr 0x16cc000 : Length 4096
> sg[15] - Addr 0xded000 : Length 4096
> (da0:ahc0:0:0:0): data overrun detected in Data-out phase.  Tag == 0x30.
> (da0:ahc0:0:0:0): Have seen Data Phase.  Length = 65536.  NumSGs = 16.
> sg[0] - Addr 0x352e000 : Length 4096
> sg[1] - Addr 0x1aef000 : Length 4096
> sg[2] - Addr 0x3930000 : Length 4096
> sg[3] - Addr 0x3ff1000 : Length 4096
> sg[4] - Addr 0x71b2000 : Length 4096
> sg[5] - Addr 0x63b3000 : Length 4096
> sg[6] - Addr 0x2b14000 : Length 4096
> sg[7] - Addr 0x56b5000 : Length 4096
> sg[8] - Addr 0x77f6000 : Length 4096
> sg[9] - Addr 0x69f7000 : Length 4096
> sg[10] - Addr 0x6538000 : Length 4096
> sg[11] - Addr 0x54d9000 : Length 4096
> sg[12] - Addr 0x4c5a000 : Length 4096
> sg[13] - Addr 0x141b000 : Length 4096
> sg[14] - Addr 0x765c000 : Length 4096
> sg[15] - Addr 0x4f1d000 : Length 4096
> (da0:ahc0:0:0:0): data overrun detected in Data-out phase.  Tag == 0x3a.
> (da0:ahc0:0:0:0): Have seen Data Phase.  Length = 65536.  NumSGs = 16.
> sg[0] - Addr 0x793e000 : Length 4096
> sg[1] - Addr 0x1c3f000 : Length 4096
> sg[2] - Addr 0xc40000 : Length 4096
> sg[3] - Addr 0x2c61000 : Length 4096
> sg[4] - Addr 0x7662000 : Length 4096
> sg[5] - Addr 0x4163000 : Length 4096
> sg[6] - Addr 0x78c4000 : Length 4096
> sg[7] - Addr 0x1685000 : Length 4096
> sg[8] - Addr 0x4906000 : Length 4096
> sg[9] - Addr 0x2927000 : Length 4096
> sg[10] - Addr 0x14c8000 : Length 4096
> sg[11] - Addr 0x6e69000 : Length 4096
> sg[12] - Addr 0x788a000 : Length 4096
> sg[13] - Addr 0x1f6b000 : Length 4096
> sg[14] - Addr 0x6fcc000 : Length 4096
> sg[15] - Addr 0x366d000 : Length 4096
> SCB 0x27 - timed out in Message-in phase, SEQADDR == 0x15b
> sg[0] - Addr 0x322e000 : Length 4096
> sg[1] - Addr 0x730f000 : Length 4096
> sg[2] - Addr 0x35d0000 : Length 4096
> sg[3] - Addr 0x6b71000 : Length 4096
> sg[4] - Addr 0x5432000 : Length 4096
> sg[5] - Addr 0x4453000 : Length 4096
> sg[6] - Addr 0x3db4000 : Length 4096
> sg[7] - Addr 0x6615000 : Length 4096
> sg[8] - Addr 0x4556000 : Length 4096
> sg[9] - Addr 0x5c77000 : Length 4096
> sg[10] - Addr 0x98000 : Length 4096
> sg[11] - Addr 0x2099000 : Length 4096
> sg[12] - Addr 0x45da000 : Length 4096
> sg[13] - Addr 0x263b000 : Length 4096
> sg[14] - Addr 0x205c000 : Length 4096
> sg[15] - Addr 0x52bd000 : Length 4096
> (da0:ahc0:0:0:0): BDR message in message buffer
> SCB 0x30 - timed out in Message-in phase, SEQADDR == 0x159
> sg[0] - Addr 0x4d6b000 : Length 3072
> panic: Disconnected List inconsistency. SCB index == 255, yet numscbs == 100.
> 
> syncing disks... 
> 
> Fatal trap 12: page fault while in kernel mode
> fault virtual address	= 0x30
> fault code		= supervisor read, page not present
> instruction pointer	= 0x8:0xc01e4500
> stack pointer	        = 0x10:0xc0285224
> frame pointer	        = 0x10:0xc0285228
> code segment		= base 0x0, limit 0xfffff, type 0x1b
> 			= DPL 0, pres 1, def32 1, gran 1
> processor eflags	= interrupt enabled, resume, IOPL = 0
> current process		= Idle
> interrupt mask		= bio cam 
> trap number		= 12
> panic: page fault
> Uptime: 19h12m3s
> (da0:ahc0:0:0:0): Synchronize cache failed, status == 0xb, scsi status == 0x0
> 
> dumping to dev #da/0x20001, offset 761872
> dump Aborting dump due to I/O error.
> status == 0xb, scsi status == 0x0
> failed, reason: i/o error
> Automatic reboot in 15 seconds - press a key on the console to abort
> Rebooting...
> 


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-scsi" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200010251344.e9PDia430080>