From owner-freebsd-questions@FreeBSD.ORG Wed Jun 11 19:41:04 2008 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id D0E2A106564A for ; Wed, 11 Jun 2008 19:41:04 +0000 (UTC) (envelope-from wojtek@wojtek.tensor.gdynia.pl) Received: from wojtek.tensor.gdynia.pl (wojtek.tensor.gdynia.pl [IPv6:2001:4070:101:2::1]) by mx1.freebsd.org (Postfix) with ESMTP id 9A5298FC17 for ; Wed, 11 Jun 2008 19:41:03 +0000 (UTC) (envelope-from wojtek@wojtek.tensor.gdynia.pl) Received: from wojtek.tensor.gdynia.pl (localhost [IPv6:::1]) by wojtek.tensor.gdynia.pl (8.14.2/8.14.2) with ESMTP id m5BJewj3062454 for ; Wed, 11 Jun 2008 21:40:58 +0200 (CEST) (envelope-from wojtek@wojtek.tensor.gdynia.pl) Received: from localhost (wojtek@localhost) by wojtek.tensor.gdynia.pl (8.14.2/8.14.2/Submit) with ESMTP id m5BJewxk062451 for ; Wed, 11 Jun 2008 21:40:58 +0200 (CEST) (envelope-from wojtek@wojtek.tensor.gdynia.pl) Date: Wed, 11 Jun 2008 21:40:57 +0200 (CEST) From: Wojciech Puchar To: freebsd-questions@freebsd.org In-Reply-To: <200806111929.m5BJTAPY070813@lurza.secnetix.de> Message-ID: <20080611213343.D31099@wojtek.tensor.gdynia.pl> References: <200806111929.m5BJTAPY070813@lurza.secnetix.de> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Subject: Re: FreeBSD + ZFS on a production server? X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 11 Jun 2008 19:41:04 -0000 > A broken processor usually results in random crashes, not > silent data corruption. result in both in my practice. with broken companion chips (chipset) it's silent data corruption is common, while crashes can be under specific cases. that's from what i've got. > > or even calculate checksum right of wrong data generated by badly > > operating programs. > > What do you mean, wrong data generated by programs? If wrong data generated by program because of hardware problem. > You usually notice it when it's too late and the last > good backup media was already recycled. not that bad, but of course - i make backups. > > i think all your cases wasn't disk, but general hardware problems. > > In my case it was a disk with media surface errors, and > the disk failed to report the error properly to the OS. > Instead it just returned bad data. so i am just happy to never having it, while normal disk failures are quite common.. > > > ZFS may help detect it, or it may not. if it helped for you. > > Please stop spreading FUD. There is no "may or may not". > If a disk returns bad data, ZFS _will_ detect it. please read more carefully. i didn't say it. i just say that "disk returning bad data" is very rare case, lots of other - more frequent - hardware problems will not be detected. if you like to give lots of CPU power and disk bandwidth for calculation of checksums on each read/write - then OK. if you think you are secured this way - then OK. i just say it doesn't make lot of protection against bad hardware, not worth the expense. i probably shouldn't type that point as it can be turned off in ZFS.