From owner-freebsd-fs@FreeBSD.ORG Mon Jun 22 01:49:49 2015 Return-Path: Delivered-To: freebsd-fs@nevdull.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id DF46D22E for ; Mon, 22 Jun 2015 01:49:49 +0000 (UTC) (envelope-from michelle@sorbs.net) Received: from hub.freebsd.org (hub.freebsd.org [IPv6:2001:1900:2254:206c::16:88]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "hub.freebsd.org", Issuer "hub.freebsd.org" (not verified)) by mx1.freebsd.org (Postfix) with ESMTPS id C4D18A2F for ; Mon, 22 Jun 2015 01:49:49 +0000 (UTC) (envelope-from michelle@sorbs.net) Received: by hub.freebsd.org (Postfix) id BAAE122D; Mon, 22 Jun 2015 01:49:49 +0000 (UTC) Delivered-To: fs@nevdull.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id B8E0B22C for ; Mon, 22 Jun 2015 01:49:49 +0000 (UTC) (envelope-from michelle@sorbs.net) Received: from hades.sorbs.net (hades.sorbs.net [67.231.146.201]) by mx1.freebsd.org (Postfix) with ESMTP id A6D6DA2E for ; Mon, 22 Jun 2015 01:49:49 +0000 (UTC) (envelope-from michelle@sorbs.net) MIME-version: 1.0 Content-transfer-encoding: 7BIT Content-type: text/plain; CHARSET=US-ASCII Received: from isux.com (firewall.isux.com [213.165.190.213]) by hades.sorbs.net (Oracle Communications Messaging Server 7.0.5.29.0 64bit (built Jul 9 2013)) with ESMTPSA id <0NQB00IYKPCDW900@hades.sorbs.net> for fs@freebsd.org; Sun, 21 Jun 2015 18:55:27 -0700 (PDT) Message-id: <558769B5.601@sorbs.net> Date: Mon, 22 Jun 2015 03:49:41 +0200 From: Michelle Sullivan User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X; en-US; rv:1.8.1.24) Gecko/20100301 SeaMonkey/1.1.19 To: Quartz Cc: Willem Jan Withagen , fs@freebsd.org Subject: Re: This diskfailure should not panic a system, but just disconnect disk from ZFS References: <5585767B.4000206@digiware.nl> <5587236A.6020404@sneakertech.com> In-reply-to: <5587236A.6020404@sneakertech.com> X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 22 Jun 2015 01:49:50 -0000 Quartz wrote: > Also: > >> And thus I'd would have expected that ZFS would disconnect /dev/da0 and >> then switch to DEGRADED state and continue, letting the operator fix the >> broken disk. > >> Next question to answer is why this WD RED on: > >> got hung, and nothing for this shows in SMART.... > > You have a raidz2, which means THREE disks need to go down before the > pool is unwritable. The problem is most likely your controller or > power supply, not your disks. > Never make such assumptions... I have worked in a professional environment where 9 of 12 disks failed within 24 hours of each other.... They were all supposed to be from different batches but due to an error they came from the same batch and the environment was so tightly controlled and the work-load was so similar that MTBF was almost identical on all 11 disks in the array... the only disk that lasted more than 2 weeks over the failure was the hotspare...! -- Michelle Sullivan http://www.mhix.org/