From owner-freebsd-stable@FreeBSD.ORG Wed Jul 21 06:15:13 2010 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C54D8106564A for ; Wed, 21 Jul 2010 06:15:13 +0000 (UTC) (envelope-from boydjd@jbip.net) Received: from mail-iw0-f182.google.com (mail-iw0-f182.google.com [209.85.214.182]) by mx1.freebsd.org (Postfix) with ESMTP id 93A608FC08 for ; Wed, 21 Jul 2010 06:15:13 +0000 (UTC) Received: by iwn35 with SMTP id 35so8242091iwn.13 for ; Tue, 20 Jul 2010 23:15:12 -0700 (PDT) Received: by 10.231.32.198 with SMTP id e6mr8825627ibd.86.1279692911181; Tue, 20 Jul 2010 23:15:11 -0700 (PDT) MIME-Version: 1.0 Received: by 10.231.207.19 with HTTP; Tue, 20 Jul 2010 23:14:51 -0700 (PDT) In-Reply-To: References: <4C4504DF.30602@langille.org> <578438.38753.qm@web50502.mail.re2.yahoo.com> From: Joshua Boyd Date: Wed, 21 Jul 2010 02:14:51 -0400 Message-ID: To: alan bryan Content-Type: text/plain; charset=ISO-8859-1 X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: freebsd-stable , Dan Langille Subject: Re: Problems replacing failing drive in ZFS pool X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 21 Jul 2010 06:15:13 -0000 On Wed, Jul 21, 2010 at 2:09 AM, Joshua Boyd wrote: > On Wed, Jul 21, 2010 at 1:57 AM, alan bryan wrote: > >> >> >> --- On Mon, 7/19/10, Dan Langille wrote: >> >> > From: Dan Langille >> > Subject: Re: Problems replacing failing drive in ZFS pool >> > To: "Freddie Cash" >> > Cc: "freebsd-stable" >> > Date: Monday, July 19, 2010, 7:07 PM >> > On 7/19/2010 12:15 PM, Freddie Cash >> > wrote: >> > > On Mon, Jul 19, 2010 at 8:56 AM, Garrett Moore> > >> > wrote: >> > >> So you think it's because when I switch from the >> > old disk to the new disk, >> > >> ZFS doesn't realize the disk has changed, and >> > thinks the data is just >> > >> corrupt now? Even if that happens, shouldn't the >> > pool still be available, >> > >> since it's RAIDZ1 and only one disk has gone >> > away? >> > > >> > > I think it's because you pull the old drive, boot with >> > the new drive, >> > > the controller re-numbers all the devices (ie da3 is >> > now da2, da2 is >> > > now da1, da1 is now da0, da0 is now da6, etc), and ZFS >> > thinks that all >> > > the drives have changed, thus corrupting the >> > pool. I've had this >> > > happen on our storage servers a couple of times before >> > I started using >> > > glabel(8) on all our drives (dead drive on RAID >> > controller, remove >> > > drive, reboot for whatever reason, all device nodes >> > are renumbered, >> > > everything goes kablooey). >> > >> > Can you explain a bit about how you use glabel(8) in >> > conjunction with ZFS? If I can retrofit this into an >> > exist ZFS array to make things easier in the future... >> > >> > 8.0-STABLE #0: Fri Mar 5 00:46:11 EST 2010 >> > >> > ]# zpool status >> > pool: storage >> > state: ONLINE >> > scrub: none requested >> > config: >> > >> > NAME >> > STATE READ WRITE CKSUM >> > storage >> > ONLINE >> > 0 0 >> > 0 >> > raidz1 >> > ONLINE 0 >> > 0 0 >> > ad8 >> > ONLINE >> > 0 0 >> > 0 >> > ad10 >> > ONLINE 0 >> > 0 0 >> > ad12 >> > ONLINE 0 >> > 0 0 >> > ad14 >> > ONLINE 0 >> > 0 0 >> > ad16 >> > ONLINE 0 >> > 0 0 >> > >> > > Of course, always have good backups. ;) >> > >> > In my case, this ZFS array is the backup. ;) >> > >> > But I'm setting up a tape library, real soon now.... >> > >> > -- Dan Langille - http://langille.org/ >> > _______________________________________________ >> > freebsd-stable@freebsd.org >> > mailing list >> > http://lists.freebsd.org/mailman/listinfo/freebsd-stable >> > To unsubscribe, send any mail to " >> freebsd-stable-unsubscribe@freebsd.org" >> > >> >> Dan, >> >> Here's how to do it after the fact: >> >> >> http://unix.derkeiler.com/Mailing-Lists/FreeBSD/current/2009-07/msg00623.html >> >> --Alan Bryan >> > > [root@foghornleghorn ~]# glabel label disk01 /dev/da0 > glabel: Can't store metadata on /dev/da0: Operation not permitted. > > Hrmph. > Nevermind, sysctl kern.geom.debugflags=16 solves that problem, but then you get this: [root@foghornleghorn ~]# zpool replace tank da0 label/disk01 cannot open 'label/disk01': no such GEOM provider must be a full path or shorthand device name > > >> >> >> >> >> >> >> _______________________________________________ >> freebsd-stable@freebsd.org mailing list >> http://lists.freebsd.org/mailman/listinfo/freebsd-stable >> To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" >> > > > > -- > Joshua Boyd > JBipNet > > E-mail: boydjd@jbip.net > > http://www.jbip.net > -- Joshua Boyd JBipNet E-mail: boydjd@jbip.net http://www.jbip.net