From owner-freebsd-geom@FreeBSD.ORG Wed Sep 10 13:48:50 2008 Return-Path: Delivered-To: freebsd-geom@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 6D1281065672 for ; Wed, 10 Sep 2008 13:48:50 +0000 (UTC) (envelope-from 000.fbsd@quip.cz) Received: from elsa.codelab.cz (elsa.codelab.cz [91.103.162.4]) by mx1.freebsd.org (Postfix) with ESMTP id E8A098FC08 for ; Wed, 10 Sep 2008 13:48:49 +0000 (UTC) (envelope-from 000.fbsd@quip.cz) Received: from localhost (localhost.codelab.cz [127.0.0.1]) by elsa.codelab.cz (Postfix) with ESMTP id 76A4019E02F; Wed, 10 Sep 2008 15:29:40 +0200 (CEST) Received: from [192.168.1.2] (r5bb235.net.upc.cz [86.49.61.235]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by elsa.codelab.cz (Postfix) with ESMTPSA id E0F4319E02D; Wed, 10 Sep 2008 15:29:37 +0200 (CEST) Message-ID: <48C7CBE2.1000904@quip.cz> Date: Wed, 10 Sep 2008 15:30:10 +0200 From: Miroslav Lachman <000.fbsd@quip.cz> User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.7.12) Gecko/20050915 X-Accept-Language: cz, cs, en, en-us MIME-Version: 1.0 To: Nuno Martinho References: <20080910102740.4b1dab5d@www.trote.org> In-Reply-To: <20080910102740.4b1dab5d@www.trote.org> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit Cc: freebsd-geom@freebsd.org Subject: Re: GEOM_MIRROR Errors X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 10 Sep 2008 13:48:50 -0000 Nuno Martinho wrote: > > I have a RAID 1 that was working fine without errors reported. The RAID was not on Degraded state. > When i shutdown Freenas, GEOM_MIRROR start to report errors during shutdown. > I’ve checked SMART and there are no problems with the disks. > > During shutdown: > Sep 10 07:33:15 kernel: ad6: TIMEOUT - WRITE_DMA48 retrying (1 retry left) LBA=976773167 > Sep 10 07:33:20 kernel: ad6: TIMEOUT - WRITE_DMA48 retrying (0 retries left) LBA=976773167 > Sep 10 07:33:25 kernel: ad6: FAILURE - WRITE_DMA48 timed out LBA=976773167 > Sep 10 07:33:25 kernel: GEOM_MIRROR: Cannot write metadata on ad6 (device=raid1, error=5). > Sep 10 07:33:25 kernel: GEOM_MIRROR: Cannot update metadata on disk ad6 (error=5). > Sep 10 07:33:32 kernel: ad7: TIMEOUT - WRITE_DMA48 retrying (1 retry left) LBA=976773167 > > > During startup: > Sep 10 07:47:47 kernel: ad6: 476940MB at ata3-master UDMA100 > Sep 10 07:47:47 kernel: ad7: 476940MB at ata3-slave UDMA100 > Sep 10 07:47:47 kernel: GEOM_MIRROR: Device raid1 created (id=3401427605). > Sep 10 07:47:47 kernel: GEOM_MIRROR: Device raid1: provider ad7 detected. > Sep 10 07:47:47 kernel: GEOM_MIRROR: Device raid1: provider ad6 detected. > Sep 10 07:47:47 kernel: GEOM_MIRROR: Component ad6 (device raid1) broken, skipping. > Sep 10 07:47:47 kernel: GEOM_MIRROR: Device raid1: provider ad7 activated. > Sep 10 07:47:47 kernel: GEOM_MIRROR: Device raid1: provider mirror/raid1 launched. > > If i try to insert disk “ad6” in the mirror: > Sep 10 07:49:40 kernel: GEOM_MIRROR: Component ad6 (device raid1) broken, skipping. > Sep 10 07:49:40 kernel: GEOM_MIRROR: Cannot add disk ad6 to raid1 (error=22). > > The only solution was rebuild the mirror. > > Why this errors? It can be caused by bad cables or controller. I had similar problems on ASUS RS-120 with Intel controller, somebody have same problems with Silicon Image chips. You can re-use ad6 by: gmirror forget raid1 [it will not destroy your mirror data, just remove info about disconnected devices (ad6 in this case)] gmirror clear ad6 [remove metadata from ad6] gmirror insert raid1 ad6 [reinsert ad6 in to raid1 and start resync] Miroslav Lachman