From owner-freebsd-questions@FreeBSD.ORG Mon May 18 10:49:48 2009 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0E5E6106567A for ; Mon, 18 May 2009 10:49:48 +0000 (UTC) (envelope-from achill@matrix.gatewaynet.com) Received: from smadev.internal.net (host3.dynacom.ondsl.gr [62.103.35.211]) by mx1.freebsd.org (Postfix) with ESMTP id 535E58FC12 for ; Mon, 18 May 2009 10:49:46 +0000 (UTC) (envelope-from achill@matrix.gatewaynet.com) Received: from smadev.internal.net (localhost [127.0.0.1]) by smadev.internal.net (8.14.2/8.14.2) with ESMTP id n4IAnjf5002494; Mon, 18 May 2009 13:49:45 +0300 (EEST) (envelope-from achill@matrix.gatewaynet.com) Received: from localhost (localhost [[UNIX: localhost]]) by smadev.internal.net (8.14.2/8.14.2/Submit) id n4IAniu8002493; Mon, 18 May 2009 13:49:44 +0300 (EEST) (envelope-from achill@matrix.gatewaynet.com) From: Achilleas Mantzios Organization: Dynacom Tankers Mgmt To: Manolis Kiagias Date: Mon, 18 May 2009 13:49:44 +0300 User-Agent: KMail/1.9.7 References: <200905181200.28732.achill@matrix.gatewaynet.com> <4A11382E.6040909@otenet.gr> In-Reply-To: <4A11382E.6040909@otenet.gr> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-7" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Message-Id: <200905181349.44924.achill@matrix.gatewaynet.com> Cc: freebsd-questions@freebsd.org Subject: Re: Weird problem with gmirror - cannot add the Good disk when previously failed SATA disk is online X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 18 May 2009 10:49:48 -0000 Hey Manoli! glad to see you again, =D3=F4=E9=F2 Monday 18 May 2009 13:27:58 =EF/=E7 Manolis Kiagias =DD=E3=F1= =E1=F8=E5: > Achilleas Mantzios wrote: > > Hello, > > in advance sorry for the cross posting, it is just that freebsd-geom di= dnt seem that populated. > > I run 7.1-PRERELEASE, its a home server. > > today morning after a power failure, the rebuild my root gm0 failed on = disk ad4. > > The messages were: > > > > May 18 08:02:02 panix kernel: ad4: WARNING - WRITE_DMA UDMA ICRC error = (retrying request) LBA=3D268091264 > > May 18 08:02:08 panix kernel: drm0: on vgapci0 > > May 18 08:02:08 panix kernel: info: [drm] AGP at 0xf0000000 128MB > > May 18 08:02:08 panix kernel: info: [drm] Initialized i915 1.5.0 200601= 19 > > May 18 08:02:08 panix kernel: drm0: [ITHREAD] > > May 18 08:02:08 panix kernel: ad4: FAILURE - device detached > > May 18 08:02:08 panix kernel: subdisk4: detached > > May 18 08:02:08 panix kernel: ad4: detached > > May 18 08:02:08 panix kernel: GEOM_MIRROR: Device gm0: provider ad4 dis= connected. > > May 18 08:02:08 panix kernel: GEOM_MIRROR: Device gm0: rebuilding provi= der ad4 stopped. > > > > =20 >=20 > It looks to me you got a bad disk now. >=20 I certainly hope so, since there is nothing else i can do > > I read http://www.eztiger.org/2008/08/removing-and-re-adding-a-disk-in-= gmirror/ > > hoping that the rebuld failure was temprary > > and so i tried to just run=20 > > # gmirror forget gm0 > > # gmirror insert gm0 ad4 > > > > But the system responded (if i remember correctly) =20 > > Unknown provider ad4. > > The system no longer could see ad4 being online. > > > > So i rebooted the system many times and had these results: > > -When having put offline ad4 (disconnected by hardware), the system boo= ted ok. > > -When having both disks online the system responded consistently=20 > > with: > > "GEOM_MIRROR: Cannot add disk ad6 to gm0 (error=3D22)." > > Which IMO is not very ok, since gm0 should add ad6 without problem, > > no matter if ad4 is online or not. > > -When having only ad4 online, then it simply cannot find gm0 at all. (k= ind of reasonable) > > > > So my only option is to have only ad6 online, with a current gmirror st= atus: > > panix# gmirror status > > Name Status Components > > mirror/gm0 COMPLETE ad6 > > > > Anyone has an idea of how should i proceed (besides buying a UPS unit!) > > Is it meaningfull to go for a new Disk to replace current ad4? > > =20 >=20 > I'd recommend attaching the bad disk on its own to a system and perform > tests on it. Is the BIOS recognizing this properly? I would run hardware Yes, the BIOS recognizes it ok i suppose. > tests on it - either manufacturer ones, or stuff like > sysutils/smartmontools. You could also try installing FreeBSD on it and > see if it works. And probably use dd to clean all the contents, esp. > the partition table and the last sector where geom information is stored. >=20 Thanx, lacking time i think i will try to use a brand new identical disk. > > Why is the presence of the supposed bad disk ad4, affecting gm0, > > when having already told gm0 to forget about ad4? > > =20 >=20 > The bad disk may be sending confusing signals to the bus / IDE > interface. I've had this once (although it was due to a bad cable). The > entire mirror would disappear suddenly. >=20 >=20 =2D-=20 Achilleas Mantzios