Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 29 May 2009 08:05:51 GMT
From:      Thomas Backman <serenity@exscape.org>
To:        freebsd-gnats-submit@FreeBSD.org
Subject:   kern/135050: ZFS clears/hides disk errors on reboot
Message-ID:  <200905290805.n4T85pAK080704@www.freebsd.org>
Resent-Message-ID: <200905290810.n4T8A4Ul068635@freefall.freebsd.org>

next in thread | raw e-mail | index | archive | help

>Number:         135050
>Category:       kern
>Synopsis:       ZFS clears/hides disk errors on reboot
>Confidential:   no
>Severity:       serious
>Priority:       high
>Responsible:    freebsd-bugs
>State:          open
>Quarter:        
>Keywords:       
>Date-Required:
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Fri May 29 08:10:04 UTC 2009
>Closed-Date:
>Last-Modified:
>Originator:     Thomas Backman
>Release:        8.0-CURRENT, r192914
>Organization:
exscape
>Environment:
FreeBSD clone.exscape.org 8.0-CURRENT FreeBSD 8.0-CURRENT #4 r192914: Thu May 28 08:56:46 CEST 2009     root@clone.exscape.org:/usr/obj/usr/src/sys/DTRACE  amd64

>Description:
(Not sure if this is kern or bin, but I'll take a shot.)

When a disk is corrupted, "zpool status" hides the fact that there has ever been any corruption if the system is rebooted. In practice, this could lead to silent corruption (that is fixed by ZFS temporarily, while the disk is dying) without the using ever finding out. Quite bad. It should tell the user that there have been problems. zpool history -il shown nothing of interest either.
>How-To-Repeat:
(... create pool etc ...)
[root@clone ~]# dd if=/dev/random of=/dev/ad2 bs=1000k count=5 seek=30
[root@clone ~]# zpool scrub test

(... wait for a while ...)

[root@clone ~]# zpool status -v test
  pool: test
 state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
	attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
	using 'zpool clear' or replace the device with 'zpool replace'.
   see: http://www.sun.com/msg/ZFS-8000-9P
 scrub: scrub completed after 0h2m with 0 errors on Fri May 29 09:52:50 2009
config:

	NAME        STATE     READ WRITE CKSUM
	test        ONLINE       0     0     0
	  raidz1    ONLINE       0     0     0
	    ad1     ONLINE       0     0     0
	    ad2     ONLINE       0     0    79  4.94M repaired
	    ad3     ONLINE       0     0     0

errors: No known data errors

[root@clone ~]# reboot

[root@clone ~]# zpool status -xv
all pools are healthy
[root@clone ~]# zpool status test
  pool: test
 state: ONLINE
 scrub: none requested
config:

	NAME        STATE     READ WRITE CKSUM
	test        ONLINE       0     0     0
	  raidz1    ONLINE       0     0     0
	    ad1     ONLINE       0     0     0
	    ad2     ONLINE       0     0     0
	    ad3     ONLINE       0     0     0

errors: No known data errors
>Fix:


>Release-Note:
>Audit-Trail:
>Unformatted:



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200905290805.n4T85pAK080704>