From owner-freebsd-stable@FreeBSD.ORG Sun Dec 21 20:25:34 2003 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 98E8B16A4CE for ; Sun, 21 Dec 2003 20:25:34 -0800 (PST) Received: from lcremeans.homeip.net (dsl092-160-012.wdc2.dsl.speakeasy.net [66.92.160.12]) by mx1.FreeBSD.org (Postfix) with ESMTP id 89D3043D5C for ; Sun, 21 Dec 2003 20:25:30 -0800 (PST) (envelope-from lee@lcremeans.homeip.net) Received: from lcremeans.homeip.net (lee.local [192.168.0.252]) by lcremeans.homeip.net (8.12.9/8.12.9) with ESMTP id hBM4ODMB096479; Sun, 21 Dec 2003 23:24:13 -0500 (EST) (envelope-from lee@lcremeans.homeip.net) Message-ID: <3FE67233.7050701@lcremeans.homeip.net> Date: Sun, 21 Dec 2003 23:25:23 -0500 From: Lee Cremeans User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.6b) Gecko/20031208 X-Accept-Language: en-us, en MIME-Version: 1.0 To: "Oivind H. Danielsen" References: In-Reply-To: Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit cc: freebsd-stable@freebsd.org Subject: Re: WRITE command timeout X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 22 Dec 2003 04:25:34 -0000 Oivind H. Danielsen wrote: > Hello. > > We have been running FreeBSD 4.6-5.1 systems for 1.5 years and are being > plagued by these: > > Dec 18 15:15:39 <> /kernel: ad0: WRITE command timeout tag=0 serv=0 - > resetting > Dec 19 15:03:23 <> /kernel: ad0: READ command timeout tag=0 serv=0 - > resetting > > > In our rack we have 34 identical drives (IBM IC35L080AVVA07). > > 24 drives on Windows 2000 : no problems. > 4 drives on Linux 2.4.x : no problems. > > 2 drives on RELENG_4_8 > (VIA 82C686, VIA C3) : no problems > > 4 drives on RELENG_4_8 > (nVIDIA nForce, XP 2000+) : r/w timeouts, fs corruption. > > (1 drive/system, 6 FreeBSD boxes) > > The good systems have been running the 1.5 years without a hitch. The > four identical RELENG_4_8 systems have all had corrupted filesystems (at > least once every two months). > > > We have tried the following: > > - Changed ATA100 cables (3 diff. types, all 80-wire) > - Disabled DMA (use PIO4) (hw.ata.ata_dma="0" in loader.conf) > - Disabled DMA in BIOS setup > - Changed motherboard (MSI MS6734, VIA KM400, vt8235 ATA) > - Changed power supply (added 100W) > - RELENG_5_1. > > None of these changes has helped. The only change seen when disabling > DMA is additional messages: "timeout waiting for DRQ - resetting". It sounds like that particular drive is on the way out. Have you tried running IBM/HGST's "Drive Fitness Tools" disk on it? (It's a DOS program, but it comes on a self-booting diskette image that you can also burn to a bootable CD if you like.) That program should be able to detect any problems, and let you know if you need to send the drive back. -lee