From owner-freebsd-questions@FreeBSD.ORG Mon May 18 09:38:58 2009 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C3CFE10656B8 for ; Mon, 18 May 2009 09:38:58 +0000 (UTC) (envelope-from panixgr@yahoo.com) Received: from web31605.mail.mud.yahoo.com (web31605.mail.mud.yahoo.com [68.142.198.151]) by mx1.freebsd.org (Postfix) with SMTP id 8907F8FC17 for ; Mon, 18 May 2009 09:38:58 +0000 (UTC) (envelope-from panixgr@yahoo.com) Received: (qmail 58387 invoked by uid 60001); 18 May 2009 09:12:16 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1242637936; bh=47m21miH/JO1DHShHyg5bIoSneZpwtmd2MVTgZ/XiIE=; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:Cc:MIME-Version:Content-Type; b=LrAq5PHcLFn3MF0vwxSvlVDyMKrnu3wtI28HweD4+B7CeiVPbnlw/mnVquZv5qPGV/bAFCdDBuMTAtqP3Y7x3rHRegAjHKgpXl6XMUj3Gtf0oLEgxusw+jvpxoYJcNIbQm3aDpA4lL5vvHTFfTjdn6eROxRFYeH6p9/g6NUADkc= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:Cc:MIME-Version:Content-Type; b=vApBK1dcjweswNfv95Fp0PMdCwew9nzpMezlGj+HraDFmaA7rSb7/Qyr2Qu/QUbroPfNaLPmqkH8DdCOzIRlOxRLuuS/AWeSNkRj8ZWbT4WIBA2O9l8Pw8bZ7VmHakHi192jepzh/o92WXfDR83wZ0I+KHZjuQMZo6wb0X8DnzY=; Message-ID: <559547.57332.qm@web31605.mail.mud.yahoo.com> X-YMail-OSG: dIi6IEoVM1mRZ4R.b87PWSGW2fe7vhPW_gBoWtcdqb27feFwuqb32jZfwklKgOgGKgZvnVhoELMNKGA3RDaKqlziSFS9njov50JZBYpFNzIsHqmOXpGO69gyRpsnRIyy0EgPuV.o67N79b3Rpmc23yrtbN7f4gp6PMu2ytgxKcaOVfIzVhPXC1piIFmbDDqT5OkAjA1XXJVgBQ5f0B.hSSku7boZP5EAORN1XBuHstcMasn.mgQajnigu10h.Xi56kvZ.HWf9MJsR5otXJ_12u_fyDIJrAQsiVsKFaCJnlVUy6Gyz0B5uLJ9NfcdXXeeEbf5c2OdA7ZOL7_3SWmb74XcRn26..7EJQ0OpZZkLkeKjHwY.ZzzjKfjdPa_FTc3A67GAYVNWQr5PxoD.vKzvdZDxdIL0Oo5QS5lJYGzNjPN._TDHPSU1pYgCg-- Received: from [62.103.35.211] by web31605.mail.mud.yahoo.com via HTTP; Mon, 18 May 2009 02:12:16 PDT X-Mailer: YahooMailRC/1277.35 YahooMailWebService/0.7.289.10 Date: Mon, 18 May 2009 02:12:16 -0700 (PDT) From: panix panix To: freebsd-questions@freebsd.org MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: freebsd-geom@freebsd.org Subject: Weird problem with gmirror - cannot add the Good disk when previously failed SATA disk is online X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 18 May 2009 09:38:59 -0000 Hello, in advance sorry for the cross posting, it is just that freebsd-geom didnt seem that populated. I run 7.1-PRERELEASE, its a home server. today morning after a power failure, the rebuild my root gm0 failed on disk ad4. The messages were: May 18 08:02:02 panix kernel: ad4: WARNING - WRITE_DMA UDMA ICRC error (retrying request) LBA=268091264 May 18 08:02:08 panix kernel: drm0: on vgapci0 May 18 08:02:08 panix kernel: info: [drm] AGP at 0xf0000000 128MB May 18 08:02:08 panix kernel: info: [drm] Initialized i915 1.5.0 20060119 May 18 08:02:08 panix kernel: drm0: [ITHREAD] May 18 08:02:08 panix kernel: ad4: FAILURE - device detached May 18 08:02:08 panix kernel: subdisk4: detached May 18 08:02:08 panix kernel: ad4: detached May 18 08:02:08 panix kernel: GEOM_MIRROR: Device gm0: provider ad4 disconnected. May 18 08:02:08 panix kernel: GEOM_MIRROR: Device gm0: rebuilding provider ad4 stopped. I read http://www.eztiger.org/2008/08/removing-and-re-adding-a-disk-in-gmirror/ hoping that the rebuld failure was temprary and so i tried to just run # gmirror forget gm0 # gmirror insert gm0 ad4 But the system responded (if i remember correctly) Unknown provider ad4. The system no longer could see ad4 being online. So i rebooted the system many times and had these results: -When having put offline ad4 (disconnected by hardware), the system booted ok. -When having both disks online the system responded consistently with: "GEOM_MIRROR: Cannot add disk ad6 to gm0 (error=22)." Which IMO is not very ok, since gm0 should add ad6 without problem, no matter if ad4 is online or not. -When having only ad4 online, then it simply cannot find gm0 at all. (kind of reasonable) So my only option is to have only ad6 online, with a current gmirror status: panix# gmirror status Name Status Components mirror/gm0 COMPLETE ad6 Anyone has an idea of how should i proceed (besides buying a UPS unit!) Is it meaningfull to go for a new Disk to replace current ad4? Why is the presence of the supposed bad disk ad4, affecting gm0, when having already told gm0 to forget about ad4? Thanx for any suggestions.