From owner-freebsd-questions Sun Feb 16 10:10:50 2003 Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 0D3BE37B401 for ; Sun, 16 Feb 2003 10:10:48 -0800 (PST) Received: from netmail02.services.quay.plus.net (netmail02.services.quay.plus.net [212.159.14.221]) by mx1.FreeBSD.org (Postfix) with SMTP id 8F13343F3F for ; Sun, 16 Feb 2003 10:10:46 -0800 (PST) (envelope-from trent@limekiln.vcisp.net) Received: (qmail 27915 invoked from network); 16 Feb 2003 18:10:45 -0000 Received: from limekiln.vcisp.net (212.159.16.110) by netmail02.services.quay.plus.net with SMTP; 16 Feb 2003 18:10:45 -0000 Received: by limekiln.vcisp.net (Postfix, from userid 1001) id 674C94B6; Sun, 16 Feb 2003 18:10:38 +0000 (GMT) Date: Sun, 16 Feb 2003 18:10:38 +0000 From: Trent Nelson To: questions@FreeBSD.org Subject: Worrying ATA problems. Message-ID: <20030216181038.GA33183@limekiln.vcisp.net> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.5.1i Sender: owner-freebsd-questions@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG Hi, I've had a new server for a little over three months now that's started running into some worrying ata troubles. The server is an Athlon XP 1800+, ASUS A7V8X M/B, 1GB PC2700 DDR RAM, 2x 60GB Maxtor (6Y060L0) HDDs. I've also purchased a ``Sil 0680 ATA133 controller'' that provides master ata channels for a spare 30GB drive and my CD-RW. It's running 4.7-STABLE as of a few days The error messages and relevant dmesg output follows. What do these errors normally indicate? Dodgy hardware? The drives are pretty much brand new (< 3 months), as are all the other components. I don't think it's a cooling problem, either. I've got a 120mm fan blowing cool air onto the drives, and each drive is fitted with an individual cooling unit (two small fans on the controller-side of the drive). The air is never more than appro- ximately 20 degrees Celsius in that part of the box. Other random tidbits; the server runs VMware and fvcool, as well as a few other standard daemons, but the box is nearly always idle. Thanks in advance. Regards, Trent. ad0: WRITE command timeout tag=0 serv=0 - resetting ata0: resetting devices .. done ad0s1g: soft error (ECC corrected) reading fsbn 3158527 of 6368-6399 (ad0s1 bn 3158527; cn 196 tn 155 sn 22)ad0s1g: hard error reading fsbn 3158527 of 6368-6399 (ad0s1 bn 3158527; cn 196 tn 155 sn 22) trying PIO mode ad0: READ command timeout tag=0 serv=0 - resetting ata0: resetting devices .. done ad0: DMA problem fallback to PIO mode ad0: DMA problem fallback to PIO mode ad0s1a: soft error (ECC corrected) reading fsbn 24231 of 12076-12095 (ad0s1 bn 24231; cn 1 tn 129 sn 39)ad0s1a: hard error reading f sbn 24231 of 12076-12095 (ad0s1 bn 24231; cn 1 tn 129 sn 39) status=7f error=7f ad0: timeout waiting for DRQ - resetting ata0: resetting devices .. ata0-slave: ATA identify retries exceeded done ad0: timeout waiting for DRQ - resetting ata0: resetting devices .. done ad0s1g: soft error (ECC corrected) reading fsbn 3158527 of 6368-6399 (ad0s1 bn 3158527; cn 196 tn 155 sn 22)ad0s1g: hard error reading fsbn 3158527 of 6368-6399 (ad0s1 bn 3158527; cn 196 tn 155 sn 22) status=7f error=7f ad0: READ command timeout tag=0 serv=0 - resetting ata0: resetting devices .. done ad0: READ command timeout tag=0 serv=0 - resetting ata0: resetting devices .. done ad0s1d: soft error (ECC corrected) reading fsbn 72749503 of 21170368-21170391 (ad0s1 bn 72749503; cn 4528 tn 114 sn 1)ad0s1d: hard e rror reading fsbn 72749503 of 21170368-21170391 (ad0s1 bn 72749503; cn 4528 tn 114 sn 1) status=7f error=7f spec_getpages:(#ad/0x20003) I/O read failure: (error=5) bp 0xcfeb9390 vp 0xe06f5b40 size: 12288, resid: 12288, a_count: 12284, valid: 0x0 nread: 0, reqpage: 0, pindex: 0, pcount: 3 ad0: READ command timeout tag=0 serv=0 - resetting ata0: resetting devices .. done Relevant dmesg output: FreeBSD 4.7-STABLE #8: Sat Feb 15 17:44:48 GMT 2003 root@zoom.limekiln.vcisp.net:/obj/shared/work/BSD/FreeBSD/FreeBSD-4.x/src/sys/ZOOM Timecounter "i8254" frequency 1193182 Hz CPU: AMD Athlon(TM) XP 1800+ (1533.40-MHz 686-class CPU) real memory = 1073725440 (1048560K bytes) avail memory = 1041625088 (1017212K bytes) atapci0: port 0x9800-0x980f,0xa000-0xa003,0xa400-0xa407,0xa800-0xa803,0xb000-0xb007 mem 0xeb000000-0xeb 0000ff irq 5 at device 12.0 on pci0 ata2: at 0xb000 on atapci0 ata3: at 0xa400 on atapci0 atapci1: port 0x8400-0x840f at device 17.1 on pci0 ata0: at 0x1f0 irq 14 on atapci1 ata1: at 0x170 irq 15 on atapci1 ad0: 58644MB [119150/16/63] at ata0-master UDMA133 ad2: 58644MB [119150/16/63] at ata1-master UDMA133 ad4: 28629MB [58168/16/63] at ata2-master UDMA100 acd0: CD-RW at ata3-master UDMA33 To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-questions" in the body of the message