From owner-freebsd-scsi Wed Oct 25 6:45: 9 2000 Delivered-To: freebsd-scsi@freebsd.org Received: from zibbi.icomtek.csir.co.za (zibbi.icomtek.csir.co.za [146.64.24.58]) by hub.freebsd.org (Postfix) with ESMTP id 399C637B4D7 for ; Wed, 25 Oct 2000 06:44:50 -0700 (PDT) Received: (from jhay@localhost) by zibbi.icomtek.csir.co.za (8.11.0/8.11.0) id e9PDia430080; Wed, 25 Oct 2000 15:44:36 +0200 (SAT) (envelope-from jhay) From: John Hay Message-Id: <200010251344.e9PDia430080@zibbi.icomtek.csir.co.za> Subject: Re: ahc still broken? In-Reply-To: <200010200736.e9K7aiQ58036@zibbi.mikom.csir.co.za> from John Hay at "Oct 20, 2000 09:36:44 am" To: jhay@icomtek.csir.co.za (John Hay) Date: Wed, 25 Oct 2000 15:44:36 +0200 (SAT) Cc: freebsd-scsi@FreeBSD.ORG X-Mailer: ELM [version 2.4ME+ PL54 (25)] MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: owner-freebsd-scsi@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org Well here is an update on the problem. I left everything at -stable and just took sys/dev/aic7xxx back to 2000-09-15 and adjusted conf/files to cater for that, built a new kernel and my problems are gone. I have made 5 releases now and haven't had a single problem. John -- John Hay -- John.Hay@icomtek.csir.co.za > Hi, > > Is the ahc driver still broken in -stable? > > I have a Dell machine that have been building -stable snapshosts every > night for months without a problem (using a kernel and user level of > April 14). Two days ago we had a prower failure and I thought it might > be a good time to upgrade the machine to the latest -stable, but now > it gives scsi errors and panic when building the -stable snapshots. > Previously I haven't seen a single scsi error and the machine had been > up for months at a time. > > Here is the console output of the boot, scsi errors and panic I have > captured through the serial port. > > John > -- > John Hay -- John.Hay@icomtek.csir.co.za > > Console: serial port > BIOS drive A: is disk0 > BIOS drive C: is disk1 > BIOS 640kB/130040kB available memory > > FreeBSD/i386 bootstrap loader, Revision 0.8 > (jhay@dolphin.mikom.csir.co.za, Wed Oct 18 13:30:42 SAST 2000) > Loading /boot/defaults/loader.conf > /kernel text=0x1824a1 data=0x427e0+0x1bc18 syms=[0x4+0x29420+0x4+0x2d924] > > Hit [Enter] to boot immediately, or any other key for command prompt. > Copyright (c) 1992-2000 The FreeBSD Project. > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 > The Regents of the University of California. All rights reserved. > FreeBSD 4.1.1-STABLE #0: Wed Oct 18 14:15:22 SAST 2000 > jhay@dolphin.mikom.csir.co.za:/usr/src/sys/compile/DOLPHIN > Timecounter "i8254" frequency 1193146 Hz > Timecounter "TSC" frequency 348916883 Hz > CPU: Pentium II/Pentium II Xeon/Celeron (348.92-MHz 686-class CPU) > Origin = "GenuineIntel" Id = 0x652 Stepping = 2 > Features=0x183f9ff > real memory = 134209536 (131064K bytes) > avail memory = 127447040 (124460K bytes) > Preloaded elf kernel "kernel" at 0xc033a000. > Pentium Pro MTRR support enabled > npx0: on motherboard > npx0: INT 16 interface > pcib0: on motherboard > pci0: on pcib0 > pcib1: at device 1.0 on pci0 > pci1: on pcib1 > pci1: at 0.0 irq 11 > isab0: at device 7.0 on pci0 > isa0: on isab0 > atapci0: port 0xffa0-0xffaf at device 7.1 on pci0 > ata1: at 0x170 irq 15 on atapci0 > pci0: at 7.2 irq 11 > intpm0: port 0x850-0x85f irq 9 at device 7.3 on pci0 > intpm0: I/O mapped 850 > intpm0: intr IRQ 9 enabled revision 0 > smbus0: on intsmb0 > smb0: on smbus0 > intpm0: PM I/O mapped 800 > pcib2: at device 15.0 on pci0 > pci2: on pcib2 > ahc0: port 0xdc00-0xdcff mem 0xfafff000-0xfaffffff irq 11 at device 10.0 on pci2 > aic7895: Wide Channel A, SCSI Id=7, 32/255 SCBs > ahc1: port 0xd800-0xd8ff mem 0xfaffe000-0xfaffefff irq 11 at device 10.1 on pci2 > aic7895: Single Channel B, SCSI Id=7, 32/255 SCBs > xl0: <3Com 3c905B-TX Fast Etherlink XL> port 0xcc00-0xcc7f mem 0xff000000-0xff00007f irq 11 at device 17.0 on pci0 > xl0: Ethernet address: 00:c0:4f:71:b3:ab > miibus0: on xl0 > xlphy0: <3Com internal media interface> on miibus0 > xlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto > fdc0: at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 > fdc0: FIFO enabled, 8 bytes threshold > fd0: <1440-KB 3.5" drive> on fdc0 drive 0 > atkbdc0: at port 0x60,0x64 on isa0 > atkbd0: flags 0x1 irq 1 on atkbdc0 > kbd0 at atkbd0 > psm0: irq 12 on atkbdc0 > psm0: model Generic PS/2 mouse, device ID 0 > vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 > sc0: at flags 0x100 on isa0 > sc0: VGA <16 virtual consoles, flags=0x100> > sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 > sio0: type 16550A, console > sio1 at port 0x2f8-0x2ff irq 3 on isa0 > sio1: type 16550A > ppc0: at port 0x378-0x37f irq 7 on isa0 > ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode > ppc0: FIFO with 16/16/8 bytes threshold > lpt0: on ppbus0 > lpt0: Interrupt-driven port > ppi0: on ppbus0 > plip0: on ppbus0 > pcm0: at port 0x534-0x537,0x388-0x38b,0x220-0x22f irq 5 drq 1,0 on isa0 > acd0: CDROM at ata1-master using WDMA2 > Waiting 2 seconds for SCSI devices to settle > Mounting root from ufs:/dev/da0s1a > da0 at ahc0 bus 0 target 0 lun 0 > da0: Fixed Direct Access SCSI-2 device > da0: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing Enabled > da0: 8709MB (17836668 512 byte sectors: 255H 63S/T 1110C) > WARNING: / was not properly dismounted > swapon: adding /dev/da0s1b as swap device > Automatic boot in progress... > /dev/da0s1a: 1443 files, 38661 used, 60522 free (522 frags, 7500 blocks, 0.5% fragmentation) > /dev/da0s1f: 101065 files, 1289928 used, 742695 free (41839 frags, 87607 blocks, 2.1% fragmentation) > /dev/da0s1g: UNREF FILE I=79410 OWNER=root MODE=100555 > /dev/da0s1g: SIZE=445920 MTIME=Oct 19 06:10 2000 (CLEARED) > /dev/da0s1g: UNREF FILE I=517315 OWNER=root MODE=100644 > /dev/da0s1g: SIZE=9311 MTIME=Oct 19 10:02 2000 (CLEARED) > /dev/da0s1g: FREE BLK COUNT(S) WRONG IN SUPERBLK (SALVAGED) > /dev/da0s1g: SUMMARY INFORMATION BAD (SALVAGED) > /dev/da0s1g: BLK(S) MISSING IN BIT MAPS (SALVAGED) > /dev/da0s1g: 103481 files, 1421325 used, 1627617 free (26329 frags, 200161 blocks, 0.9% fragmentation) > /dev/da0s1h: 142048 files, 1809221 used, 1057687 free (5335 frags, 131544 blocks, 0.2% fragmentation) > /dev/da0s1e: 900 files, 29997 used, 69186 free (1282 frags, 8488 blocks, 1.3% fragmentation) > Doing initial network setup: hostname. > xl0: flags=8843 mtu 1500 > inet 146.64.28.14 netmask 0xffffff00 broadcast 146.64.28.255 > inet6 fe80::2c0:4fff:fe71:b3ab%xl0 prefixlen 64 tentative scopeid 0x1 > ether 00:c0:4f:71:b3:ab > media: autoselect (100baseTX ) status: active > supported media: autoselect 100baseTX 100baseTX 10baseT/UTP 10baseT/UTP 100baseTX > lo0: flags=8049 mtu 16384 > inet6 fe80::1%lo0 prefixlen 64 scopeid 0x9 > inet6 ::1 prefixlen 128 > inet 127.0.0.1 netmask 0xff000000 > add net default: gateway 146.64.28.30 > Additional routing options: tcp extensions=NO TCP keepalive=YES. > routing daemons:. > Doing IPv6 network setup:add net ::ffff:0.0.0.0: gateway ::1 > add net ::0.0.0.0: gateway ::1 > net.inet6.ip6.forwarding: 0 -> 0 > net.inet6.ip6.accept_rtadv: 0 -> 1 > add net fe80::: gateway fe80::2c0:4fff:fe71:b3ab%xl0 > add net ff02::: gateway fe80::2c0:4fff:fe71:b3ab%xl0 > IPv4 mapped IPv6 address support=YES. > clearing /tmp > additional daemons: syslogd. > checking for core dump...savecore: no core dump > Doing additional network setup: ntpd. > Starting final network daemons:. > setting ELF ldconfig path: /usr/lib /usr/lib/compat /usr/X11R6/lib /usr/local/lib > setting a.out ldconfig path: /usr/lib/aout /usr/lib/compat/aout /usr/X11R6/lib/aout > starting standard daemons: cron printer sendmail sshd. > Initial rc.i386 initialization:. > rc.i386 configuring syscons: blank_time moused. > additional ABI support: linux. > starting local daemons:starting local daemons:. > . > Local package initialization:. > Additional TCP options:. > Thu Oct 19 10:20:05 SAST 2000 > Oct 19 10:58:31 dolphin /kernel: /mnt: optimization changed from SPACE to TIME > Oct 19 10:59:40 dolphin /kernel: /mnt: optimization changed from SPACE to TIME > Oct 19 11:00:39 dolphin /kernel: /mnt: optimization changed from SPACE to TIME > Oct 19 11:01:46 dolphin /kernel: /mnt: optimization changed from SPACE to TIME > Oct 19 15:20:41 dolphin /kernel: /mnt: optimization changed from SPACE to TIME > Oct 19 15:28:08 dolphin /kernel: /mnt: optimization changed from SPACE to TIME > Oct 19 15:29:24 dolphin last message repeated 2 times > > (da0:ahc0:0:0:0): data overrun detected in Data-out phase. Tag == 0x32. > (da0:ahc0:0:0:0): Have seen Data Phase. Length = 65536. NumSGs = 16. > sg[0] - Addr 0x328e000 : Length 4096 > sg[1] - Addr 0x440f000 : Length 4096 > sg[2] - Addr 0x2f70000 : Length 4096 > sg[3] - Addr 0xd91000 : Length 4096 > sg[4] - Addr 0x3872000 : Length 4096 > sg[5] - Addr 0x9f3000 : Length 4096 > sg[6] - Addr 0x4394000 : Length 4096 > sg[7] - Addr 0x3b55000 : Length 4096 > sg[8] - Addr 0xcb6000 : Length 4096 > sg[9] - Addr 0x33b7000 : Length 4096 > sg[10] - Addr 0x40f8000 : Length 4096 > sg[11] - Addr 0x2879000 : Length 4096 > sg[12] - Addr 0x6d3a000 : Length 4096 > sg[13] - Addr 0x56db000 : Length 4096 > sg[14] - Addr 0x3d7c000 : Length 4096 > sg[15] - Addr 0x735d000 : Length 4096 > (da0:ahc0:0:0:0): data overrun detected in Data-out phase. Tag == 0x32. > (da0:ahc0:0:0:0): Have seen Data Phase. Length = 65536. NumSGs = 16. > sg[0] - Addr 0x328e000 : Length 4096 > sg[1] - Addr 0x440f000 : Length 4096 > sg[2] - Addr 0x2f70000 : Length 4096 > sg[3] - Addr 0xd91000 : Length 4096 > sg[4] - Addr 0x3872000 : Length 4096 > sg[5] - Addr 0x9f3000 : Length 4096 > sg[6] - Addr 0x4394000 : Length 4096 > sg[7] - Addr 0x3b55000 : Length 4096 > sg[8] - Addr 0xcb6000 : Length 4096 > sg[9] - Addr 0x33b7000 : Length 4096 > sg[10] - Addr 0x40f8000 : Length 4096 > sg[11] - Addr 0x2879000 : Length 4096 > sg[12] - Addr 0x6d3a000 : Length 4096 > sg[13] - Addr 0x56db000 : Length 4096 > sg[14] - Addr 0x3d7c000 : Length 4096 > sg[15] - Addr 0x735d000 : Length 4096 > (da0:ahc0:0:0:0): data overrun detected in Data-out phase. Tag == 0x4. > (da0:ahc0:0:0:0): Have seen Data Phase. Length = 65536. NumSGs = 16. > sg[0] - Addr 0x12fe000 : Length 4096 > sg[1] - Addr 0x61bf000 : Length 4096 > sg[2] - Addr 0x6a80000 : Length 4096 > sg[3] - Addr 0x67a1000 : Length 4096 > sg[4] - Addr 0x2f22000 : Length 4096 > sg[5] - Addr 0x6e63000 : Length 4096 > sg[6] - Addr 0x3d04000 : Length 4096 > sg[7] - Addr 0x40c5000 : Length 4096 > sg[8] - Addr 0x45c6000 : Length 4096 > sg[9] - Addr 0x61a7000 : Length 4096 > sg[10] - Addr 0x7b48000 : Length 4096 > sg[11] - Addr 0x1009000 : Length 4096 > sg[12] - Addr 0x44ea000 : Length 4096 > sg[13] - Addr 0x49eb000 : Length 4096 > sg[14] - Addr 0x16cc000 : Length 4096 > sg[15] - Addr 0xded000 : Length 4096 > (da0:ahc0:0:0:0): data overrun detected in Data-out phase. Tag == 0x30. > (da0:ahc0:0:0:0): Have seen Data Phase. Length = 65536. NumSGs = 16. > sg[0] - Addr 0x352e000 : Length 4096 > sg[1] - Addr 0x1aef000 : Length 4096 > sg[2] - Addr 0x3930000 : Length 4096 > sg[3] - Addr 0x3ff1000 : Length 4096 > sg[4] - Addr 0x71b2000 : Length 4096 > sg[5] - Addr 0x63b3000 : Length 4096 > sg[6] - Addr 0x2b14000 : Length 4096 > sg[7] - Addr 0x56b5000 : Length 4096 > sg[8] - Addr 0x77f6000 : Length 4096 > sg[9] - Addr 0x69f7000 : Length 4096 > sg[10] - Addr 0x6538000 : Length 4096 > sg[11] - Addr 0x54d9000 : Length 4096 > sg[12] - Addr 0x4c5a000 : Length 4096 > sg[13] - Addr 0x141b000 : Length 4096 > sg[14] - Addr 0x765c000 : Length 4096 > sg[15] - Addr 0x4f1d000 : Length 4096 > (da0:ahc0:0:0:0): data overrun detected in Data-out phase. Tag == 0x3a. > (da0:ahc0:0:0:0): Have seen Data Phase. Length = 65536. NumSGs = 16. > sg[0] - Addr 0x793e000 : Length 4096 > sg[1] - Addr 0x1c3f000 : Length 4096 > sg[2] - Addr 0xc40000 : Length 4096 > sg[3] - Addr 0x2c61000 : Length 4096 > sg[4] - Addr 0x7662000 : Length 4096 > sg[5] - Addr 0x4163000 : Length 4096 > sg[6] - Addr 0x78c4000 : Length 4096 > sg[7] - Addr 0x1685000 : Length 4096 > sg[8] - Addr 0x4906000 : Length 4096 > sg[9] - Addr 0x2927000 : Length 4096 > sg[10] - Addr 0x14c8000 : Length 4096 > sg[11] - Addr 0x6e69000 : Length 4096 > sg[12] - Addr 0x788a000 : Length 4096 > sg[13] - Addr 0x1f6b000 : Length 4096 > sg[14] - Addr 0x6fcc000 : Length 4096 > sg[15] - Addr 0x366d000 : Length 4096 > SCB 0x27 - timed out in Message-in phase, SEQADDR == 0x15b > sg[0] - Addr 0x322e000 : Length 4096 > sg[1] - Addr 0x730f000 : Length 4096 > sg[2] - Addr 0x35d0000 : Length 4096 > sg[3] - Addr 0x6b71000 : Length 4096 > sg[4] - Addr 0x5432000 : Length 4096 > sg[5] - Addr 0x4453000 : Length 4096 > sg[6] - Addr 0x3db4000 : Length 4096 > sg[7] - Addr 0x6615000 : Length 4096 > sg[8] - Addr 0x4556000 : Length 4096 > sg[9] - Addr 0x5c77000 : Length 4096 > sg[10] - Addr 0x98000 : Length 4096 > sg[11] - Addr 0x2099000 : Length 4096 > sg[12] - Addr 0x45da000 : Length 4096 > sg[13] - Addr 0x263b000 : Length 4096 > sg[14] - Addr 0x205c000 : Length 4096 > sg[15] - Addr 0x52bd000 : Length 4096 > (da0:ahc0:0:0:0): BDR message in message buffer > SCB 0x30 - timed out in Message-in phase, SEQADDR == 0x159 > sg[0] - Addr 0x4d6b000 : Length 3072 > panic: Disconnected List inconsistency. SCB index == 255, yet numscbs == 100. > > syncing disks... > > Fatal trap 12: page fault while in kernel mode > fault virtual address = 0x30 > fault code = supervisor read, page not present > instruction pointer = 0x8:0xc01e4500 > stack pointer = 0x10:0xc0285224 > frame pointer = 0x10:0xc0285228 > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, def32 1, gran 1 > processor eflags = interrupt enabled, resume, IOPL = 0 > current process = Idle > interrupt mask = bio cam > trap number = 12 > panic: page fault > Uptime: 19h12m3s > (da0:ahc0:0:0:0): Synchronize cache failed, status == 0xb, scsi status == 0x0 > > dumping to dev #da/0x20001, offset 761872 > dump Aborting dump due to I/O error. > status == 0xb, scsi status == 0x0 > failed, reason: i/o error > Automatic reboot in 15 seconds - press a key on the console to abort > Rebooting... > To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-scsi" in the body of the message