From owner-freebsd-questions@FreeBSD.ORG Mon Feb 28 19:27:45 2005 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 3833D16A4CE for ; Mon, 28 Feb 2005 19:27:45 +0000 (GMT) Received: from smtp11.wanadoo.fr (smtp11.wanadoo.fr [193.252.22.31]) by mx1.FreeBSD.org (Postfix) with ESMTP id AFD4143D5C for ; Mon, 28 Feb 2005 19:27:44 +0000 (GMT) (envelope-from atkielski.anthony@wanadoo.fr) Received: from me-wanadoo.net (localhost [127.0.0.1]) by mwinf1102.wanadoo.fr (SMTP Server) with ESMTP id E4AAB1C0009C for ; Mon, 28 Feb 2005 20:27:43 +0100 (CET) Received: from pix.atkielski.com (ASt-Lambert-111-2-1-3.w81-50.abo.wanadoo.fr [81.50.80.3]) by mwinf1102.wanadoo.fr (SMTP Server) with ESMTP id BFB481C00097 for ; Mon, 28 Feb 2005 20:27:43 +0100 (CET) X-ME-UUID: 20050228192743785.BFB481C00097@mwinf1102.wanadoo.fr Date: Mon, 28 Feb 2005 20:27:43 +0100 From: Anthony Atkielski X-Priority: 3 (Normal) Message-ID: <886633821.20050228202743@wanadoo.fr> To: freebsd-questions@freebsd.org In-Reply-To: References: <1561762673.20050227155330@wanadoo.fr> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Subject: Re: WRITE_DMA errors on SATA drive under 5.3-RELEASE X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list Reply-To: freebsd-questions@freebsd.org List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 28 Feb 2005 19:27:45 -0000 Garance A Drosihn writes: > First question: which SATA controller are you using? The controller is built into the Asus P4P800-E motherboard, and is based on the Intel ICH5R southbridge chipset. There's also a Promise 20378 RAID controller on board but I do NOT use it (disabled in BIOS). > And what is the make&model of the hard drives that you are using? The SATA drives are two identical Western Digital WD1200JD 120-GB drives, 7200 RPM. Device ad10 holds /tmp and /var; device ad12 holds /usr. There is also a third drive, an older Samsung SV4002H (40 GB), connected to the primary IDE controller. This drive holds the root /. Although the error messages I've seen name ad10 (the first SATA drive), smartctl says that no errors have occurred on either of these drives--whereas it does show a log of errors on the third drive (ad0) that seem to correspond mysterious to the errors in the message. > Note: There have been several different threads on different mailing > lists from users having WRITE_DMA errors similar to this. At least > some of the problem is in the code which handles disk I/O. So I've surmised. The problem seems to be quite rare, but since this is a production server I worry about disk writes not being completed; I have no easy way to tell whether writes were actually lost or not. > I realize that none of that info really helps you right now, but > I just thought I would say that it may be you're not having any > hardware problems. Or at least, not on the disk itself. It might > be a problem with the disk-controller, or it might be fairly minor > timing-problems that come up under certain kinds of load. I don't think there are any hardware problems at all. This isn't a terribly exotic configuration. It's probably a bug or configuration problem. -- Anthony