From owner-freebsd-fs@FreeBSD.ORG  Fri Jun 12 17:49:00 2009
Return-Path: <owner-freebsd-fs@FreeBSD.ORG>
Delivered-To: freebsd-fs@freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34])
	by hub.freebsd.org (Postfix) with ESMTP id 8F2641065673
	for <freebsd-fs@freebsd.org>; Fri, 12 Jun 2009 17:49:00 +0000 (UTC)
	(envelope-from serenity@exscape.org)
Received: from ch-smtp01.sth.basefarm.net (ch-smtp01.sth.basefarm.net
	[80.76.149.212])
	by mx1.freebsd.org (Postfix) with ESMTP id 0EF148FC1B
	for <freebsd-fs@freebsd.org>; Fri, 12 Jun 2009 17:48:59 +0000 (UTC)
	(envelope-from serenity@exscape.org)
Received: from c83-253-252-234.bredband.comhem.se ([83.253.252.234]:43692
	helo=mx.exscape.org)
	by ch-smtp01.sth.basefarm.net with esmtp (Exim 4.69)
	(envelope-from <serenity@exscape.org>)
	id 1MFAca-0008LK-5s; Fri, 12 Jun 2009 19:32:59 +0200
Received: from [192.168.1.5] (macbookpro [192.168.1.5])
	(using TLSv1 with cipher AES128-SHA (128/128 bits))
	(No client certificate requested)
	by mx.exscape.org (Postfix) with ESMTPSA id 8EB7969A0C;
	Fri, 12 Jun 2009 19:32:54 +0200 (CEST)
Message-Id: <920A69B1-4F06-477E-A13B-63CC22A13120@exscape.org>
From: Thomas Backman <serenity@exscape.org>
To: FreeBSD Current <freebsd-current@freebsd.org>
Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes
Content-Transfer-Encoding: 7bit
Mime-Version: 1.0 (Apple Message framework v935.3)
Date: Fri, 12 Jun 2009 19:32:51 +0200
X-Mailer: Apple Mail (2.935.3)
X-Originating-IP: 83.253.252.234
X-Scan-Result: No virus found in message 1MFAca-0008LK-5s.
X-Scan-Signature: ch-smtp01.sth.basefarm.net 1MFAca-0008LK-5s
	1b2082f1ed181962a2cdaf51ba028691
Cc: freebsd-fs@freebsd.org
Subject: ZFS: Silent/hidden errors, nothing logged anywhere
X-BeenThere: freebsd-fs@freebsd.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Filesystems <freebsd-fs.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-fs>,
	<mailto:freebsd-fs-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-fs>
List-Post: <mailto:freebsd-fs@freebsd.org>
List-Help: <mailto:freebsd-fs-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-fs>,
	<mailto:freebsd-fs-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Fri, 12 Jun 2009 17:49:00 -0000

OK, so I filed a PR late May (kern/135050): http://www.freebsd.org/cgi/query-pr.cgi?pr=135050 
  .
I don't know if this is a "feature" or a bug, but it really should be  
considered the latter. The data could be repaired in the background  
without the user ever knowing - until the disk dies completely. I'd  
prefer to have warning signs (i.e. checksum errors) so that I can buy  
a replacement drive *before* that.

Not only does this mean that errors can go unnoticed, but also that  
it's impossible to figure out which disk is broken, if ZFS has  
*temporarily* repaired the broken data! THAT is REALLY bad!
Is this something that we can expect to see changed before 8.0-RELEASE?

BTW, note that the md5sums always check out (good!), and that it never  
mentions "x MB repaired" when repairing silent damage (bad!), but only  
when scrubbing. Scrubbing may be a hard task with a dying disk - I  
haven't tried it, but I'd guess so.

Regards,
Thomas

PS. I'm not subscribed to fs@, so please CC me if you read this  
message over there.


[root@clone ~]# uname -a
FreeBSD clone.exscape.org 8.0-CURRENT FreeBSD 8.0-CURRENT #0 r194059M:  
Fri Jun 12 18:25:05 CEST 2009     root@clone.exscape.org:/usr/obj/usr/ 
src/sys/DTRACE  amd64
[root@clone ~]# sysctl kern.geom.debugflags=0x10 ### To allow  
overwriting of the disk
kern.geom.debugflags: 0 -> 16
[root@clone ~]# zpool create test raidz da1 da2 da3
[root@clone ~]# dd if=/dev/random of=/test/testfile bs=1000k
dd: /test/testfile: No space left on device
188+0 records in
187+1 records out
192413696 bytes transferred in 105.004322 secs (1832436 bytes/sec)
[root@clone ~]# dd if=/dev/random of=/dev/da3 bs=1000k count=10 seek=80
10+0 records in
10+0 records out
10240000 bytes transferred in 0.838391 secs (12213871 bytes/sec)
[root@clone ~]# cat /test/testfile > /dev/null
[root@clone ~]# zpool status -xv
   pool: test
  state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
	attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the  
errors
	using 'zpool clear' or replace the device with 'zpool replace'.
    see: http://www.sun.com/msg/ZFS-8000-9P
  scrub: none requested
config:

	NAME        STATE     READ WRITE CKSUM
	test        ONLINE       0     0     0
	  raidz1    ONLINE       0     0     0
	    da1     ONLINE       0     0     0
	    da2     ONLINE       0     0     0
	    da3     ONLINE       0     0    92

errors: No known data errors
[root@clone ~]# reboot

--- immediately after reboot ---

[root@clone ~]# zpool status -xv
   pool: test
  state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
	attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the  
errors
	using 'zpool clear' or replace the device with 'zpool replace'.
    see: http://www.sun.com/msg/ZFS-8000-9P
  scrub: none requested
config:

	NAME        STATE     READ WRITE CKSUM
	test        ONLINE       0     0     0
	  raidz1    ONLINE       0     0     0
	    da1     ONLINE       0     0     0
	    da2     ONLINE       0     0     0
	    da3     ONLINE       0     0     1

errors: No known data errors

[root@clone ~]# zpool scrub test
(...)

[root@clone ~]# zpool status -xv
   pool: test
  state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
	attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the  
errors
	using 'zpool clear' or replace the device with 'zpool replace'.
    see: http://www.sun.com/msg/ZFS-8000-9P
  scrub: scrub completed after 0h0m with 0 errors on Fri Jun 12  
19:11:36 2009
config:

	NAME        STATE     READ WRITE CKSUM
	test        ONLINE       0     0     0
	  raidz1    ONLINE       0     0     0
	    da1     ONLINE       0     0     0
	    da2     ONLINE       0     0     0
	    da3     ONLINE       0     0    88  2.72M repaired

errors: No known data errors

[root@clone ~]# reboot

--- immediately after reboot, again ---

[root@clone ~]# zpool status -xv
all pools are healthy
[root@clone ~]# zpool status -v test
   pool: test
  state: ONLINE
  scrub: none requested
config:

	NAME        STATE     READ WRITE CKSUM
	test        ONLINE       0     0     0
	  raidz1    ONLINE       0     0     0
	    da1     ONLINE       0     0     0
	    da2     ONLINE       0     0     0
	    da3     ONLINE       0     0     0

errors: No known data errors
[root@clone ~]#

----------------- even more testing, no scrub this time  
-----------------

[root@clone ~]# sysctl kern.geom.debugflags=0x10
kern.geom.debugflags: 0 -> 16
[root@clone ~]# md5 /test/testfile && dd if=/dev/random of=/dev/da2  
bs=1000k count=10 seek=40 ; md5 /test/testfile
MD5 (/test/testfile) = 510479f16592bf66e7ba63c0a4dda0b6
10+0 records in
10+0 records out
10240000 bytes transferred in 0.901645 secs (11357020 bytes/sec)
MD5 (/test/testfile) = 510479f16592bf66e7ba63c0a4dda0b6
[root@clone ~]# zpool status -xv
   pool: test
  state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
	attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the  
errors
	using 'zpool clear' or replace the device with 'zpool replace'.
    see: http://www.sun.com/msg/ZFS-8000-9P
  scrub: none requested
config:

	NAME        STATE     READ WRITE CKSUM
	test        ONLINE       0     0     0
	  raidz1    ONLINE       0     0     0
	    da1     ONLINE       0     0     0
	    da2     ONLINE       0     0   104
	    da3     ONLINE       0     0     0

errors: No known data errors
[root@clone ~]# reboot

--- immediately after reboot, yet again ---

[root@clone ~]# md5 /test/testfile
MD5 (/test/testfile) = 510479f16592bf66e7ba63c0a4dda0b6
[root@clone ~]# zpool status -xv
   pool: test
  state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
	attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the  
errors
	using 'zpool clear' or replace the device with 'zpool replace'.
    see: http://www.sun.com/msg/ZFS-8000-9P
  scrub: none requested
config:

	NAME        STATE     READ WRITE CKSUM
	test        ONLINE       0     0     0
	  raidz1    ONLINE       0     0     0
	    da1     ONLINE       0     0     0
	    da2     ONLINE       0     0     3
	    da3     ONLINE       0     0     0

errors: No known data errors
[root@clone ~]# reboot

--- immediately after reboot, yet *again* ---

[root@clone ~]# md5 /test/testfile
MD5 (/test/testfile) = 510479f16592bf66e7ba63c0a4dda0b6
[root@clone ~]# zpool status -xv
all pools are healthy
[root@clone ~]# zpool status -v test
   pool: test
  state: ONLINE
  scrub: none requested
config:

	NAME        STATE     READ WRITE CKSUM
	test        ONLINE       0     0     0
	  raidz1    ONLINE       0     0     0
	    da1     ONLINE       0     0     0
	    da2     ONLINE       0     0     0
	    da3     ONLINE       0     0     0

errors: No known data errors
[root@clone ~]# zpool history -il test
History for 'test':
2009-06-12.19:03:43 zpool create test raidz da1 da2 da3 [user root on  
clone.exscape.org:global]
2009-06-12.19:10:42 [internal pool scrub txg:160] func=1 mintxg=0  
maxtxg=160 [user root on clone.exscape.org]
2009-06-12.19:10:44 zpool scrub test [user root on  
clone.exscape.org:global]
2009-06-12.19:11:36 [internal pool scrub done txg:162] complete=1  
[user root on clone.exscape.org]