From owner-freebsd-fs@FreeBSD.ORG Wed Oct 1 13:06:43 2014 Return-Path: Delivered-To: freebsd-fs@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 17D11E3D for ; Wed, 1 Oct 2014 13:06:43 +0000 (UTC) Received: from mail-wi0-x233.google.com (mail-wi0-x233.google.com [IPv6:2a00:1450:400c:c05::233]) (using TLSv1 with cipher ECDHE-RSA-RC4-SHA (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id A3DE4216 for ; Wed, 1 Oct 2014 13:06:42 +0000 (UTC) Received: by mail-wi0-f179.google.com with SMTP id d1so531816wiv.6 for ; Wed, 01 Oct 2014 06:06:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=75LBMN1Hv/mum65TDy0CUAVnKyklV1wEhb9Jh2LNcIo=; b=hGd0MITVxgwkIRz4tGODGWYWJ/fc/SqDWRipmP9zLCrEINScS8ErnDT4FeoxFnswTo qUrZMqNXOrQuhMM/FVgFX6lboFgXfe3I/fz1N/pDbTfBy/Qj+qwsU5L45N+FFMxn9b1D BA1Pc5bxGLv2p5kdNd6fVLAaiysWSqZcKlKsRzw1CzfU6HqfsyW8J+6jE2goBbg9g+5Y gD/TEuGRPdXNr4cqCgEarRUM7eAwACnNB/SHlYcjuIP8gaQu56a66gStV87YxiP2TuNV 6XYyRRKWfzpEmONrTUDUIeFtwhF59HluY3kqc0ICZQET5+g53s+7iHWJCBo+K2/705UL zbLA== MIME-Version: 1.0 X-Received: by 10.194.76.97 with SMTP id j1mr60472076wjw.40.1412168800795; Wed, 01 Oct 2014 06:06:40 -0700 (PDT) Received: by 10.27.137.130 with HTTP; Wed, 1 Oct 2014 06:06:40 -0700 (PDT) In-Reply-To: <542BF853.3040604@internetx.com> References: <542BC135.1070906@Skynet.be> <542BDDB3.8080805@internetx.com> <542BF853.3040604@internetx.com> Date: Wed, 1 Oct 2014 16:06:40 +0300 Message-ID: Subject: Re: HAST with broken HDD From: George Kontostanos To: jg@internetx.com Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Content-Filtered-By: Mailman/MimeDel 2.1.18-1 Cc: freebsd-fs@freebsd.org, JF-Bogaerts X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.18-1 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 01 Oct 2014 13:06:43 -0000 On Wed, Oct 1, 2014 at 3:49 PM, InterNetX - Juergen Gotteswinter < jg@internetx.com> wrote: > Am 01.10.2014 um 14:28 schrieb George Kontostanos: > > > > On Wed, Oct 1, 2014 at 1:55 PM, InterNetX - Juergen Gotteswinter > > > wrote: > > > > Am 01.10.2014 um 10:54 schrieb JF-Bogaerts: > > > Hello, > > > I'm preparing a HA NAS solution using HAST. > > > I'm wondering what will happen if one of disks of the primary > node will > > > fail or become erratic. > > > > > > Thx, > > > Jean-Fran=C3=A7ois Bogaerts > > > > nothing. if you are using zfs on top of hast zfs wont even take > notice > > about the disk failure. > > > > as long as the write operation was sucessfull on one of the 2 nodes= , > > hast doesnt notify the ontop layers about io errors. > > > > interesting concept, took me some time to deal with this. > > > > > > Are you saying that the pool will appear to be optimal even with a bad > > drive? > > > > > > https://forums.freebsd.org/viewtopic.php?&t=3D24786 > It appears that this is actually the case. And it is very disturbing, meaning that a drive failure goes unnoticed. In my case I completely removed the second disk on the primary node and a zpool status showed absolutely no problem. Scrubbing the pool began resilvering which indicates that there is actually something wrong! pool: tank state: ONLINE status: One or more devices has experienced an error resulting in data corruption. Applications may be affected. action: Restore the file in question if possible. Otherwise restore the entire pool from backup. see: http://illumos.org/msg/ZFS-8000-8A scan: scrub repaired 16K in 0h2m with 7 errors on Wed Oct 1 16:00:47 201= 4 config: NAME STATE READ WRITE CKSUM tank ONLINE 0 0 7 mirror-0 ONLINE 0 0 40 hast/disk1 ONLINE 0 0 40 hast/disk2 ONLINE 0 0 40 Unfortunately, in this case there was data loss and hastctl status does not report the missing disk! Name Status Role Components disk1 complete primary /dev/ada1 hast2 disk2 complete primary /dev/ada2 hast2 --=20 George Kontostanos --- http://www.aisecure.net