From owner-freebsd-stable@FreeBSD.ORG Thu Jun 24 17:52:32 2010 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 7098F106564A for ; Thu, 24 Jun 2010 17:52:32 +0000 (UTC) (envelope-from matt@bubblegen.co.uk) Received: from relay.pcl-ipout01.plus.net (relay.pcl-ipout01.plus.net [212.159.7.99]) by mx1.freebsd.org (Postfix) with ESMTP id 0E0DE8FC13 for ; Thu, 24 Jun 2010 17:52:31 +0000 (UTC) X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: Av0EAJY4I0xUXebj/2dsb2JhbACDHZwacbB+kTaBKYE7gU1wBA Received: from outmx01.plus.net ([84.93.230.227]) by relay.pcl-ipout01.plus.net with ESMTP; 24 Jun 2010 18:52:30 +0100 Received: from bubblegen.plus.com ([80.229.236.194] helo=[192.136.1.18]) by outmx01.plus.net with esmtp (Exim) id 1ORqbG-0006Ft-0X; Thu, 24 Jun 2010 18:52:30 +0100 From: Matthew Lear To: Bob Bishop In-Reply-To: <82A96ECD-676C-4A4D-A328-0CFAABD64D50@gid.co.uk> References: <1276844904.7519.19.camel@almscliff.bubblegen.co.uk> <20100618082127.GA34578@icarus.home.lan> <1276876031.7519.39.camel@almscliff.bubblegen.co.uk> <20100618174208.GA47470@icarus.home.lan> <1276889330.2210.44.camel@almscliff.bubblegen.co.uk> <1277155992.1860.3.camel@almscliff.bubblegen.co.uk> <20100622074541.GA71157@icarus.home.lan> <82A96ECD-676C-4A4D-A328-0CFAABD64D50@gid.co.uk> Content-Type: text/plain; charset="UTF-8" Date: Thu, 24 Jun 2010 18:52:14 +0100 Message-ID: <1277401934.1874.12.camel@almscliff.bubblegen.co.uk> Mime-Version: 1.0 X-Mailer: Evolution 2.28.3 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org, Jeremy Chadwick Subject: Re: 7.2-RELEASE-p4, IO errors & RAID1 failure X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 24 Jun 2010 17:52:32 -0000 On Tue, 2010-06-22 at 20:04 +0100, Bob Bishop wrote: > Hi, > > On 22 Jun 2010, at 08:45, Jeremy Chadwick wrote: > > > On Mon, Jun 21, 2010 at 10:33:12PM +0100, Matthew Lear wrote: > >> [tale of woe elided] > > > > I don't really have any other thoughts on the matter, sadly. > > [helpful suggestions elided] > > > > Anyone else have ideas/recommendations? > > The disks sure look OK. I wouldn't rule out the controller(s), I've had various chipsets fail in odd ways. > Thanks Bob. I think we all thought the same. I've actually just rebooted the machine and FreeBSD no longer boots. This isn't what I was expecting at all. Something has clearly gone wrong with some file system metadata. When I commissioned the machine I installed an 'early' bootloader (apologies for perhaps using an incorrect term) which boots FreeBSD by default (F1 option) or from Drive 1 (F5). Drive 1 is the DVD drive. It appears to be the case that the early bootloader tries to boot FreeBSD and fails. I get the messages: error 1 lba 795079 Invalid format FreeBSD/i386 boot Default: 0:ad(0,a)/boot/kernel/kernel boot: error 1 lba 786815 No /boot/kernel/kernel FreeBSD/i386 boot Default: 0:ad(0,a)/boot/kernel/kernel boot: ...and I'm at a boot prompt. So, given that ad0 was the failed disk, the bootloader has failed to find specific boot data on ad0 and dropped me into a boot prompt. I'm tempted to replace the boot line with 0:ad(2,a)/boot/kernel/kernel or should that be 2:ad(0,a)/boot/kernel/kernel but I'm a little suspicious of doing anything at this point? Can anybody offer any guidance of what I can do to restore my system? I was able to shut down the machine cleanly (shutdown -p now) and despite the RAID mirror going offline, everything seemed to be behaving normally (expected I guess given that I just lost some redundancy). I'm just that little bit more worried now :-( If the disks are ok, what on earth could have happened and more importantly, how can I restore what was an operational system when I shut it down?! Puzzled.... -- Matt