From owner-freebsd-current@FreeBSD.ORG Wed Nov 26 10:15:10 2003 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id DFF0F16A4E2 for ; Wed, 26 Nov 2003 10:15:09 -0800 (PST) Received: from ms-smtp-02-eri0.southeast.rr.com (ms-smtp-02-lbl.southeast.rr.com [24.25.9.101]) by mx1.FreeBSD.org (Postfix) with ESMTP id 74F8943F85 for ; Wed, 26 Nov 2003 10:15:08 -0800 (PST) (envelope-from marcus@marcuscom.com) Received: from creme-brulee.marcuscom.com (rdu74-159-108.nc.rr.com [24.74.159.108])hAQIF5Pf016969 for ; Wed, 26 Nov 2003 13:15:06 -0500 (EST) Received: from [10.2.1.4] (vpn-client-4.marcuscom.com [10.2.1.4]) hAQIE5uY002993 for ; Wed, 26 Nov 2003 13:14:05 -0500 (EST) (envelope-from marcus@marcuscom.com) From: Joe Marcus Clarke To: current@freebsd.org Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="=-8ZneiDb9kC82vYZA7i3w" Organization: MarcusCom, Inc. Message-Id: <1069870507.752.18.camel@gyros> Mime-Version: 1.0 X-Mailer: Ximian Evolution 1.4.5 Date: Wed, 26 Nov 2003 13:15:07 -0500 X-Spam-Status: No, hits=-4.9 required=5.0 tests=BAYES_00 autolearn=ham version=2.60 X-Spam-Checker-Version: SpamAssassin 2.60 (1.212-2003-09-23-exp) on creme-brulee.marcuscom.com X-Virus-Scanned: Symantec AntiVirus Scan Engine Subject: Recent ATA drivers giving problems with SATA X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 26 Nov 2003 18:15:10 -0000 --=-8ZneiDb9kC82vYZA7i3w Content-Type: text/plain Content-Transfer-Encoding: quoted-printable About a month ago, I bought a new SATA controller and a 160 GB Seagate SATA drive for my -CURRENT machine. All was working fine until about a week ago. Then, the drive started experiencing hard, unrecoverable DMA errors. I RMA'd the drive, then bought a new Maxtor 80 GB SATA drive (just yesterday). I started a buildworld on this drive, and it religiously fails about half-way through all the time (never at exactly the same place twice, however). The kernels I had when the failures occurred were: FreeBSD fugu.marcuscom.com 5.2-BETA FreeBSD 5.2-BETA #0: Mon Nov 24 23:14:49 EST 2003 =20 gnome@fugu.marcuscom.com:/space/obj/usr/src/sys/FUGU i386 FreeBSD fugu.marcuscom.com 5.1-CURRENT FreeBSD 5.1-CURRENT #0: Mon Nov 17 21:23:07 EST 2003 =20 gnome@fugu.marcuscom.com:/space/obj/usr/src/sys/FUGU i386 Kernels before that did not experience the problem. The buildworld fails with an Input/Output error, then I see the following on the console: Nov 26 02:35:12 fugu kernel: ad4: WARNING - WRITE_DMA recovered from missing interrupt Nov 26 02:35:12 fugu kernel: ad4: FAILURE - WRITE_DMA status=3Dff error=3D0 Nov 26 02:35:22 fugu kernel: ad4: WARNING - READ_DMA recovered from missing interrupt Nov 26 02:35:22 fugu kernel: ad4: FAILURE - READ_DMA status=3Dff error=3D0 ... Nov 26 04:37:24 fugu kernel: ad4: timeout sending command=3Dca Nov 26 04:37:24 fugu kernel: ad4: error issuing DMA command At this point, the machine is unusable, and the above two lines scroll by continuously until the machine is rebooted. Here are the dmesg specifics for the controller and drive: atapci1: port 0x14b0-0x14bf,0x14c0-0x14c3,0x14c8-0x14cf,0x14c4-0x14c7,0x14d0-0x14d7 mem 0xe800a000-0xe800a1ff irq 9 at device 16.0 on pci0 GEOM: create disk ad4 dp=3D0xc5246460 ad4: 78167MB [158816/16/63] at ata2-master UDMA133 Nothing else was changed in the machine except the specific version of -CURRENT since the time things worked and now. In addition to replacing the drive, I have replaced the SATA cable as well. My plan is to revert the ATA drivers to two weeks ago, and see if the problem persists.=20 Failing that, I will test to see if this is a cooling problem. Failing that, I will replace the SATA controller. However, I wanted to know if I'm barking up the wrong tree, and perhaps this is a software issue.=20 Thanks. Joe --=20 PGP Key : http://www.marcuscom.com/pgp.asc --=-8ZneiDb9kC82vYZA7i3w Content-Type: application/pgp-signature; name=signature.asc Content-Description: This is a digitally signed message part -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.3 (FreeBSD) iD8DBQA/xO2rb2iPiv4Uz4cRAoJiAJ9NBmWVu65Do1rMQSwyktj6/lBzLACggiiE TpSV+3ECFWEG2ydnmu7wxuE= =BHzw -----END PGP SIGNATURE----- --=-8ZneiDb9kC82vYZA7i3w--