From owner-freebsd-stable@FreeBSD.ORG Wed Jan 19 21:33:15 2005 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 8F78416A4CE for ; Wed, 19 Jan 2005 21:33:15 +0000 (GMT) Received: from wproxy.gmail.com (wproxy.gmail.com [64.233.184.207]) by mx1.FreeBSD.org (Postfix) with ESMTP id 0E11F43D39 for ; Wed, 19 Jan 2005 21:33:15 +0000 (GMT) (envelope-from jsimola@gmail.com) Received: by wproxy.gmail.com with SMTP id 67so10054wri for ; Wed, 19 Jan 2005 13:33:12 -0800 (PST) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:reply-to:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:references; b=mkoSJf/61ltksaiexfQ5/WmcdajQRPHnUB0h2R4uyToeIzJPRdu1GsV81gXr7zNSnI15hNbzBrv2PO3etQ90CtmNMlEKPMJ3TZmjYeqCRwCcD9L4agEpnm8+i3/UOlY9UEDwgy0HJTNZpCc3ci8FvQ2TCA23Mqlkqj2lk+t/xxY= Received: by 10.54.54.39 with SMTP id c39mr200265wra; Wed, 19 Jan 2005 13:33:12 -0800 (PST) Received: by 10.54.39.34 with HTTP; Wed, 19 Jan 2005 13:33:12 -0800 (PST) Message-ID: <8eea040805011913334b140af6@mail.gmail.com> Date: Wed, 19 Jan 2005 13:33:12 -0800 From: Jon Simola To: Karl Denninger , freebsd-stable@freebsd.org In-Reply-To: <20050119151301.A22310@Denninger.Net> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit References: <20050119151301.A22310@Denninger.Net> Subject: Re: Bad disk or kernel (ATA Driver) problem? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list Reply-To: jon@abccomm.com List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 19 Jan 2005 21:33:15 -0000 On Wed, 19 Jan 2005 15:13:01 -0600, Karl Denninger wrote: > I've got a 2 x SATA system here I'm playing with in preparation to move > over production to 5.x. > > These drives have been working under 4.x for quite some time - they're 250GB > Maxtor disks.... > > ad4: 239372MB [486344/16/63] at ata2-master SATA150 > ad6: 239372MB [486344/16/63] at ata3-master SATA150 > > The first disk runs nice and happy. > > The second does too, provided that the load isn't too high. If it is, then I > start to get DMA transfer errors, such as the following: > > ad6: FAILURE - READ_DMA status=51 error=40 > LBA=543191 > GEOM_MIRROR: Request failed (error=5). ad6[READ(offset=278048256, > length=102400)] > GEOM_MIRROR: Device m0: provider ad6 disconnected. > ad6: FAILURE - READ_DMA status=51 error=40 > LBA=300463 > ad6: TIMEOUT - READ_DMA retrying (2 retries left) LBA=90863 > ad6: FAILURE - READ_DMA timed out > ad6: FAILURE - READ_DMA status=51 error=40 > ad6: TIMEOUT - READ_DMA retrying (2 retries left) LBA=120663 > ad6: FAILURE - READ_DMA timed out > > I'm having a lot of trouble believing this is an actual disk problem. Among > other things, its happening at different places - not always at the same > block. I've got a few 1U Supermicro boxes running dual SATA drives: ad4: 78167MB [158816/16/63] at ata2-master SATA150 ad6: 78167MB [158816/16/63] at ata3-master SATA150 I've run into all sorts of problems with every one, and changing the IDE channel settings in the BIOS always fixes it. Which really annoys me, because I setup a new box, run it for a couple weeks, then the drives start getting flaky under load. Then I go change the setting in the BIOS (that I always forget to do on initial setup) and it's dead stable for months at a time. I've had the exact same problem with FreeBSD 5.3 and OpenBSD 3.5 as well.