From owner-freebsd-questions@FreeBSD.ORG Mon May 18 12:39:39 2009 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 787A51065672 for ; Mon, 18 May 2009 12:39:39 +0000 (UTC) (envelope-from freebsd-questions@m.gmane.org) Received: from ciao.gmane.org (main.gmane.org [80.91.229.2]) by mx1.freebsd.org (Postfix) with ESMTP id 015488FC17 for ; Mon, 18 May 2009 12:39:38 +0000 (UTC) (envelope-from freebsd-questions@m.gmane.org) Received: from list by ciao.gmane.org with local (Exim 4.43) id 1M627z-000422-F0 for freebsd-questions@freebsd.org; Mon, 18 May 2009 12:39:35 +0000 Received: from lara.cc.fer.hr ([161.53.72.113]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 18 May 2009 12:39:35 +0000 Received: from ivoras by lara.cc.fer.hr with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 18 May 2009 12:39:35 +0000 X-Injected-Via-Gmane: http://gmane.org/ To: freebsd-questions@freebsd.org From: Ivan Voras Date: Mon, 18 May 2009 14:39:13 +0200 Lines: 108 Message-ID: References: <559547.57332.qm@web31605.mail.mud.yahoo.com> Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="------------enig29A127FFA9FA0D2DFC2C3B4D" X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: lara.cc.fer.hr User-Agent: Thunderbird 2.0.0.21 (X11/20090409) In-Reply-To: <559547.57332.qm@web31605.mail.mud.yahoo.com> X-Enigmail-Version: 0.95.7 Sender: news Cc: freebsd-geom@freebsd.org Subject: Re: Weird problem with gmirror - cannot add the Good disk when previously failed SATA disk is online X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 18 May 2009 12:39:39 -0000 This is an OpenPGP/MIME signed message (RFC 2440 and 3156) --------------enig29A127FFA9FA0D2DFC2C3B4D Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable panix panix wrote: > Hello, > in advance sorry for the cross posting, it is just that freebsd-geom di= dnt seem that populated. > I run 7.1-PRERELEASE, its a home server. > today morning after a power failure, the rebuild my root gm0 failed on = disk ad4. > The messages were: >=20 > May 18 08:02:02 panix kernel: ad4: WARNING - WRITE_DMA UDMA ICRC error = (retrying request) LBA=3D268091264 > May 18 08:02:08 panix kernel: drm0: on vgapci0 > May 18 08:02:08 panix kernel: info: [drm] AGP at 0xf0000000 128MB > May 18 08:02:08 panix kernel: info: [drm] Initialized i915 1.5.0 200601= 19 > May 18 08:02:08 panix kernel: drm0: [ITHREAD] > May 18 08:02:08 panix kernel: ad4: FAILURE - device detached > May 18 08:02:08 panix kernel: subdisk4: detached > May 18 08:02:08 panix kernel: ad4: detached > May 18 08:02:08 panix kernel: GEOM_MIRROR: Device gm0: provider ad4 dis= connected. > May 18 08:02:08 panix kernel: GEOM_MIRROR: Device gm0: rebuilding provi= der ad4 stopped. >=20 > I read http://www.eztiger.org/2008/08/removing-and-re-adding-a-disk-in-= gmirror/ > hoping that the rebuld failure was temprary > and so i tried to just run=20 > # gmirror forget gm0 > # gmirror insert gm0 ad4 >=20 > But the system responded (if i remember correctly) =20 > Unknown provider ad4. > The system no longer could see ad4 being online. Yes, as you were informed by the "device detached" message - after that point the ad4 was removed from /dev. > So i rebooted the system many times and had these results: > -When having put offline ad4 (disconnected by hardware), the system boo= ted ok. > -When having both disks online the system responded consistently=20 > with: > "GEOM_MIRROR: Cannot add disk ad6 to gm0 (error=3D22)." Which means that gm0 was somehow created before - maybe from the "stale" ad4 copy? If so, you are attempting to add a newer generation of data (from ad6) to a gm0 instantiated from an older generation (from ad4). This could explain the error code (22=3Dinvalid argument). OTOH if you only have ad6 in the system this means you are trying to insert ad6 into a mirror which is already instantiated by ad6 - which is trivially wrong. > Which IMO is not very ok, since gm0 should add ad6 without problem, > no matter if ad4 is online or not. You cannot really expect the system to behave correctly with broken hardware. > -When having only ad4 online, then it simply cannot find gm0 at all. (k= ind of reasonable) Relatively. Is the ad4 recognized by the system? You didn't really clear metadata on ad4 so it should be recognized, but as a stale version (hopefully). If it isn't recognized at all, then it's broken. > So my only option is to have only ad6 online, with a current gmirror st= atus: > panix# gmirror status > Name Status Components > mirror/gm0 COMPLETE ad6 This is ok. > Anyone has an idea of how should i proceed (besides buying a UPS unit!)= > Is it meaningfull to go for a new Disk to replace current ad4? Yes. Then proceed with gmirror insert. > Why is the presence of the supposed bad disk ad4, affecting gm0, > when having already told gm0 to forget about ad4? It's relatively common (it was more common in the days of PATA cables) to have a bad drive interfering with the rest of the system. --------------enig29A127FFA9FA0D2DFC2C3B4D Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iEYEARECAAYFAkoRVvcACgkQldnAQVacBcgKJwCfY4H3rkYTUE7UpTZKAXRkW0Pk ZjEAoM4CpZcQSRpNSyqv8e/8KjYBVxfU =BrYf -----END PGP SIGNATURE----- --------------enig29A127FFA9FA0D2DFC2C3B4D--