From owner-freebsd-stable@FreeBSD.ORG Tue Dec 14 02:35:00 2004 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 7917916A4CE for ; Tue, 14 Dec 2004 02:35:00 +0000 (GMT) Received: from carver.gumbysoft.com (carver.gumbysoft.com [66.220.23.50]) by mx1.FreeBSD.org (Postfix) with ESMTP id 631FE43D2F for ; Tue, 14 Dec 2004 02:35:00 +0000 (GMT) (envelope-from dwhite@gumbysoft.com) Received: by carver.gumbysoft.com (Postfix, from userid 1000) id 5666372DD4; Mon, 13 Dec 2004 18:35:00 -0800 (PST) Received: from localhost (localhost [127.0.0.1]) by carver.gumbysoft.com (Postfix) with ESMTP id 5181A72DCB; Mon, 13 Dec 2004 18:35:00 -0800 (PST) Date: Mon, 13 Dec 2004 18:35:00 -0800 (PST) From: Doug White To: Joe Rhett In-Reply-To: <20041213192119.GB4781@meer.net> Message-ID: <20041213183336.T97507@carver.gumbysoft.com> References: <20041213052628.GB78120@meer.net> <20041213054159.GC78120@meer.net><20041213060549.GE78120@meer.net> <20041213192119.GB4781@meer.net> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII cc: freebsd-stable@freebsd.org cc: =?iso-8859-1?Q?S=F8ren?= Schmidt Subject: Re: drive failure during rebuild causes page fault X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 14 Dec 2004 02:35:00 -0000 On Mon, 13 Dec 2004, Joe Rhett wrote: > > This is why I don't trust ATA RAID for fault tolerance -- it'll save your > > data, but the system will tank. Since the disk state is maintained by > > the OS and not abstracted by a separate processor, if a disk dies in a > > particularly bad way the system may not be able to cope. > > Yes, but SATA isn't limited by this problem. It does have a processor per > disk. (this is all SATA, if I didn't make that clear) Actually on SATA its worse -- the disk just stops responding to everything and hangs. If you don't detect this condition then you go into an infinite wait. In any case, yes the ATA RAID code could use a massive robustness pass. So could the core ATA code. Patches accepted :) -- Doug White | FreeBSD: The Power to Serve dwhite@gumbysoft.com | www.FreeBSD.org