Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 3 Apr 2007 16:37:51 GMT
From:      Russell Sutherland<russ@madhaus.cns.utoronto.ca>
To:        freebsd-gnats-submit@FreeBSD.org
Subject:   i386/111196: SATA drives cause errors and cause system to hang
Message-ID:  <200704031637.l33GbpQg055070@www.freebsd.org>
Resent-Message-ID: <200704031650.l33Go437089727@freefall.freebsd.org>

next in thread | raw e-mail | index | archive | help

>Number:         111196
>Category:       i386
>Synopsis:       SATA drives cause errors and cause system to hang
>Confidential:   no
>Severity:       critical
>Priority:       medium
>Responsible:    freebsd-i386
>State:          open
>Quarter:        
>Keywords:       
>Date-Required:
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Tue Apr 03 16:50:04 GMT 2007
>Closed-Date:
>Last-Modified:
>Originator:     Russell Sutherland
>Release:        6.2
>Organization:
University of Toronto
>Environment:
FreeBSD backup.cns.utoronto.ca 6.2-RELEASE FreeBSD 6.2-RELEASE #1: Fri Jan 19 09:53:48 EST 2007     root@backup.cns.utoronto.ca:/usr/obj/usr/src/sys/RPS  i386

>Description:
The machine shows disk errors when using one SATA drive. When I use
two SATA drives (independently or with GEOM RAID-0) the system hangs
intermittently after disk use.

Here is the existing output from dmesg:

# dmesg
Copyright (c) 1992-2007 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
        The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 6.2-RELEASE #1: Fri Jan 19 09:53:48 EST 2007
    root@backup.cns.utoronto.ca:/usr/obj/usr/src/sys/RPS
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: Intel Pentium III (804.02-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x686  Stepping = 6
  Features=0x383f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE>
real memory  = 536784896 (511 MB)
avail memory = 515862528 (491 MB)
kbd1 at kbdmux0
ath_hal: 0.9.17.2 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413)
acpi0: <ASUS CUSL2-C> on motherboard
acpi0: Power Button (fixed)
Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
acpi_timer0: <24-bit timer at 3.579545MHz> port 0xe408-0xe40b on acpi0
cpu0: <ACPI CPU> on acpi0
acpi_throttle0: <ACPI CPU Throttling> on cpu0
acpi_button0: <Power Button> on acpi0
pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
pci0: <ACPI PCI bus> on pcib0
agp0: <Intel 82815 (i815 GMCH) host to PCI bridge> mem 0xf8000000-0xfbffffff at device 0.0 on pci0
pcib1: <ACPI PCI-PCI bridge> at device 1.0 on pci0
pci1: <ACPI PCI bus> on pcib1
pci1: <display, VGA> at device 0.0 (no driver attached)
pcib2: <ACPI PCI-PCI bridge> at device 30.0 on pci0
pci2: <ACPI PCI bus> on pcib2
atapci0: <SiI 3112 SATA150 controller> port 0xb800-0xb807,0xb400-0xb403,0xb000-0xb007,0xa800-0xa803,0xa400-0xa40f mem 0xf5000000-0xf50001ff irq 5 at device 10.0 on pci2
ata2: <ATA channel 0> on atapci0
ata3: <ATA channel 1> on atapci0
fxp0: <Intel 82550 Pro/100 Ethernet> port 0xa000-0xa03f mem 0xf4800000-0xf4800fff,0xf4000000-0xf401ffff irq 9 at device 11.0 on pci2
miibus0: <MII bus> on fxp0
inphy0: <i82555 10/100 media interface> on miibus0
inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
fxp0: Ethernet address: 00:02:b3:96:d1:13
ahc0: <Adaptec 2940 Ultra SCSI adapter> port 0x9800-0x98ff mem 0xf3800000-0xf3800fff irq 3 at device 12.0 on pci2
ahc0: [GIANT-LOCKED]
aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs
isab0: <PCI-ISA bridge> at device 31.0 on pci0
isa0: <ISA bus> on isab0
atapci1: <Intel ICH2 UDMA100 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x8800-0x880f at device 31.1 on pci0
ata0: <ATA channel 0> on atapci1
ata1: <ATA channel 1> on atapci1
uhci0: <Intel 82801BA/BAM (ICH2) USB controller USB-A> port 0x8400-0x841f irq 12 at device 31.2 on pci0
uhci0: [GIANT-LOCKED]
usb0: <Intel 82801BA/BAM (ICH2) USB controller USB-A> on uhci0
usb0: USB revision 1.0
uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
pci0: <serial bus, SMBus> at device 31.3 (no driver attached)
uhci1: <Intel 82801BA/BAM (ICH2) USB controller USB-B> port 0x8000-0x801f irq 9 at device 31.4 on pci0
uhci1: [GIANT-LOCKED]
usb1: <Intel 82801BA/BAM (ICH2) USB controller USB-B> on uhci1
usb1: USB revision 1.0
uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
fdc0: <floppy drive controller> port 0x3f2-0x3f5,0x3f7 irq 6 drq 2 on acpi0
fdc0: [FAST]
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
sio0: type 16550A
atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
pmtimer0 on isa0
orm0: <ISA Option ROMs> at iomem 0xc0000-0xcbfff,0xcc000-0xd07ff,0xd4000-0xd57ff,0xd8000-0xd87ff on isa0
ppc0: parallel port not found.
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
sio1: configured irq 3 not in bitmap of probed irqs 0
sio1: port may not be enabled
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
Timecounter "TSC" frequency 804023284 Hz quality 800
Timecounters tick every 1.000 msec
ad0: 32634MB <IBM DPTA-373420 P71DA32A> at ata0-master UDMA66
acd0: CDROM <CD-ROM 36X/AKW/U22> at ata1-master PIO4
ad4: 305245MB <Seagate ST3320620AS 3.AAC> at ata2-master SATA150
sa0 at ahc0 bus 0 target 6 lun 0
sa0: <QUANTUM DLT7000 2255> Removable Sequential Access SCSI-2 device 
sa0: 20.000MB/s transfers (10.000MHz, offset 8, 16bit)
Trying to mount root from ufs:/dev/ad0s1a
WARNING: / was not properly dismounted
WARNING: /tmp was not properly dismounted
WARNING: /usr was not properly dismounted
WARNING: /var was not properly dismounted
ad4: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=72259695
ad4: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=24614031
ad4: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=181025423
ad4: TIMEOUT - WRITE_DMA48 retrying (1 retry left) LBA=270973551
ad4: TIMEOUT - WRITE_DMA48 retrying (0 retries left) LBA=270973551
ad4: FAILURE - WRITE_DMA48 status=51<READY,DSC,ERROR> error=10<NID_NOT_FOUND> LBA=270973551
g_vfs_done():ad4s1a[WRITE(offset=138738417664, length=16384)]error = 5
ad4: TIMEOUT - WRITE_DMA48 retrying (1 retry left) LBA=300705359
ad4: TIMEOUT - WRITE_DMA48 retrying (0 retries left) LBA=300705359
ad4: FAILURE - WRITE_DMA48 status=51<READY,DSC,ERROR> error=10<NID_NOT_FOUND> LBA=300705359
g_vfs_done():ad4s1a[WRITE(offset=153961103360, length=16384)]error = 5
ad4: TIMEOUT - WRITE_DMA48 retrying (1 retry left) LBA=338340559
ad4: TIMEOUT - WRITE_DMA48 retrying (0 retries left) LBA=338340559
ad4: FAILURE - WRITE_DMA48 timed out LBA=338340559
g_vfs_done():ad4s1a[WRITE(offset=173230325760, length=16384)]error = 5
ad4: TIMEOUT - WRITE_DMA48 retrying (1 retry left) LBA=357550095
ad4: FAILURE - WRITE_DMA48 status=51<READY,DSC,ERROR> error=10<NID_NOT_FOUND> LBA=357550095
g_vfs_done():ad4s1a[WRITE(offset=183065608192, length=131072)]error = 5

>How-To-Repeat:
Reboot machine with two SATA drives. The SATA drive which I have removed,
in order to gain some semblance of stability is:

WD 3200KS - 00PFB0
WD Caviar  SE16
>Fix:
None.
>Release-Note:
>Audit-Trail:
>Unformatted:



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200704031637.l33GbpQg055070>