From owner-freebsd-stable@FreeBSD.ORG Sun Dec 21 08:28:47 2003 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id C1C6616A4CE for ; Sun, 21 Dec 2003 08:28:47 -0800 (PST) Received: from MXR-2.estpak.ee (ld1.estpak.ee [194.126.101.98]) by mx1.FreeBSD.org (Postfix) with ESMTP id E93B543D5E for ; Sun, 21 Dec 2003 08:28:44 -0800 (PST) (envelope-from kalts@estpak.ee) Received: from localhost (reha2 [127.0.0.1]) by MXR-2.estpak.ee (Postfix) with ESMTP id 4416128A3A; Sun, 21 Dec 2003 18:28:56 +0200 (EET) Received: from MXR-2.estpak.ee ([127.0.0.1]) by localhost (reha2 [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 24653-03; Sun, 21 Dec 2003 18:28:56 +0200 (EET) Received: from kevad.internal (80-235-44-163-dsl.mus.estpak.ee [80.235.44.163]) by MXR-2.estpak.ee (Postfix) with ESMTP id 84B55289BD; Sun, 21 Dec 2003 18:28:55 +0200 (EET) Received: from kevad.internal (localhost [127.0.0.1]) by kevad.internal (8.12.10/8.12.10) with ESMTP id hBLGSgvS002084; Sun, 21 Dec 2003 18:28:42 +0200 (EET) (envelope-from vallo@kevad.internal) Received: (from vallo@localhost) by kevad.internal (8.12.10/8.12.10/Submit) id hBLGSfE0002083; Sun, 21 Dec 2003 18:28:41 +0200 (EET) (envelope-from vallo) Date: Sun, 21 Dec 2003 18:28:41 +0200 From: Vallo Kallaste To: "Oivind H. Danielsen" Message-ID: <20031221162841.GA1533@kevad.internal> References: Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.4i-ja.1 X-Virus-Scanned: by amavisd-new at neti.ee cc: freebsd-stable@freebsd.org Subject: Re: WRITE command timeout X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list Reply-To: kalts@estpak.ee List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 21 Dec 2003 16:28:47 -0000 On Sat, Dec 20, 2003 at 07:07:41PM +0100, "Oivind H. Danielsen" wrote: > We have been running FreeBSD 4.6-5.1 systems for 1.5 years and are being > plagued by these: > > Dec 18 15:15:39 <> /kernel: ad0: WRITE command timeout tag=0 serv=0 - > resetting > Dec 19 15:03:23 <> /kernel: ad0: READ command timeout tag=0 serv=0 - > resetting > In our rack we have 34 identical drives (IBM IC35L080AVVA07). > > 24 drives on Windows 2000 : no problems. > 4 drives on Linux 2.4.x : no problems. > > 2 drives on RELENG_4_8 > (VIA 82C686, VIA C3) : no problems > > 4 drives on RELENG_4_8 > (nVIDIA nForce, XP 2000+) : r/w timeouts, fs corruption. > > (1 drive/system, 6 FreeBSD boxes) > > The good systems have been running the 1.5 years without a hitch. The > four identical RELENG_4_8 systems have all had corrupted filesystems (at > least once every two months). You seem to like fighting with your FreeBSD boxes, 1,5 years is a lot of time in terms of FreeBSD releases.. Otherwise I would suggest running Linux and be done with it. I'm far from being Linux advocate and have no Linux systems at the moment, but sometimes you must decide what you value.. Have you tried to move the seemingly failing disks from the FreeBSD boxes to working ones? Simply swap the disks, as you have cluster it should be simple. Perhaps swap cables, too, to have less variables in comparison. -- Vallo Kallaste