From owner-freebsd-fs@FreeBSD.ORG  Mon Jun 22 01:49:49 2015
Return-Path: <owner-freebsd-fs@FreeBSD.ORG>
Delivered-To: freebsd-fs@nevdull.freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org
 [IPv6:2001:1900:2254:206a::19:1])
 (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits))
 (No client certificate requested)
 by hub.freebsd.org (Postfix) with ESMTPS id DF46D22E
 for <freebsd-fs@nevdull.freebsd.org>; Mon, 22 Jun 2015 01:49:49 +0000 (UTC)
 (envelope-from michelle@sorbs.net)
Received: from hub.freebsd.org (hub.freebsd.org
 [IPv6:2001:1900:2254:206c::16:88])
 (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
 (Client CN "hub.freebsd.org", Issuer "hub.freebsd.org" (not verified))
 by mx1.freebsd.org (Postfix) with ESMTPS id C4D18A2F
 for <freebsd-fs@FreeBSD.ORG>; Mon, 22 Jun 2015 01:49:49 +0000 (UTC)
 (envelope-from michelle@sorbs.net)
Received: by hub.freebsd.org (Postfix)
 id BAAE122D; Mon, 22 Jun 2015 01:49:49 +0000 (UTC)
Delivered-To: fs@nevdull.freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115])
 (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits))
 (No client certificate requested)
 by hub.freebsd.org (Postfix) with ESMTPS id B8E0B22C
 for <fs@nevdull.freebsd.org>; Mon, 22 Jun 2015 01:49:49 +0000 (UTC)
 (envelope-from michelle@sorbs.net)
Received: from hades.sorbs.net (hades.sorbs.net [67.231.146.201])
 by mx1.freebsd.org (Postfix) with ESMTP id A6D6DA2E
 for <fs@freebsd.org>; Mon, 22 Jun 2015 01:49:49 +0000 (UTC)
 (envelope-from michelle@sorbs.net)
MIME-version: 1.0
Content-transfer-encoding: 7BIT
Content-type: text/plain; CHARSET=US-ASCII
Received: from isux.com (firewall.isux.com [213.165.190.213])
 by hades.sorbs.net
 (Oracle Communications Messaging Server 7.0.5.29.0 64bit (built Jul 9 2013))
 with ESMTPSA id <0NQB00IYKPCDW900@hades.sorbs.net> for fs@freebsd.org; Sun,
 21 Jun 2015 18:55:27 -0700 (PDT)
Message-id: <558769B5.601@sorbs.net>
Date: Mon, 22 Jun 2015 03:49:41 +0200
From: Michelle Sullivan <michelle@sorbs.net>
User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X; en-US; rv:1.8.1.24)
 Gecko/20100301 SeaMonkey/1.1.19
To: Quartz <quartz@sneakertech.com>
Cc: Willem Jan Withagen <wjw@digiware.nl>, fs@freebsd.org
Subject: Re: This diskfailure should not panic a system,
 but just disconnect disk from ZFS
References: <5585767B.4000206@digiware.nl> <5587236A.6020404@sneakertech.com>
In-reply-to: <5587236A.6020404@sneakertech.com>
X-BeenThere: freebsd-fs@freebsd.org
X-Mailman-Version: 2.1.20
Precedence: list
List-Id: Filesystems <freebsd-fs.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/options/freebsd-fs>,
 <mailto:freebsd-fs-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-fs/>
List-Post: <mailto:freebsd-fs@freebsd.org>
List-Help: <mailto:freebsd-fs-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-fs>,
 <mailto:freebsd-fs-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Mon, 22 Jun 2015 01:49:50 -0000

Quartz wrote:
> Also:
>
>> And thus I'd would have expected that ZFS would disconnect /dev/da0 and
>> then switch to DEGRADED state and continue, letting the operator fix the
>> broken disk.
>
>> Next question to answer is why this WD RED on:
>
>> got hung, and nothing for this shows in SMART....
>
> You have a raidz2, which means THREE disks need to go down before the
> pool is unwritable. The problem is most likely your controller or
> power supply, not your disks.
>
Never make such assumptions...

I have worked in a professional environment where 9 of 12 disks failed
within 24 hours of each other....  They were all supposed to be from
different batches but due to an error they came from the same batch and
the environment was so tightly controlled and the work-load was so
similar that MTBF was almost identical on all 11 disks in the array...
the only disk that lasted more than 2 weeks over the failure was the
hotspare...!

-- 
Michelle Sullivan
http://www.mhix.org/