Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 10 Jun 2011 13:11:02 +0100
From:      Howard Jones <howie@thingy.com>
To:        freebsd-stable@freebsd.org
Subject:   ZFS on 8.1 - various problems after a disk failure.
Message-ID:  <4DF209D6.4000107@thingy.com>

next in thread | raw e-mail | index | archive | help
I have a FreeBSD 8.2 server at home with 4 2TB drives in it running ZFS
in a raidz. Some time ago, I had a disk fail. Initially it wasn't
totally obvious the disk had failed so I ran a 'zpool scrub' on the
pool, which threw up a lot of errors, and also produces a lot of sense
errors, making it obvious I had a dead disk.

I replaced the disk, then ran "zpool replace zjumbo ad4 ad4" to replace
the bad disk in-place, and start a resilver.

Now I have a few problems:
1) The old ad4 is still listed, even after several scrub/resilvers.
Shouldn't it go away?
2) Although I lost a whole directory with ~1TB of music, the space
allocated to that directory is still around according df.
3) I have another bunch of files that appear in directory listings, but
if I get "Illegal byte sequence" errors when trying to read them (with
anything - du, file, wc).

I have backups of most of the stuff on the pool (although it'd be nice
to recover the more recent data), but how do I get out of this situation
without nuking the site from orbit? (my current plan) Firstly, to get a
reliable representation of what's actually on the filesystem, and for
bonus points, getting back some of the data that should be intact (only
one disk in the set was actually bad, right?).

Here's my current zpool status. Thanks in advance for any pointers!

Howie

# zpool status
  pool: zjumbo
 state: DEGRADED
status: One or more devices has experienced an error resulting in data
        corruption.  Applications may be affected.
action: Restore the file in question if possible.  Otherwise restore the
        entire pool from backup.
   see: http://www.sun.com/msg/ZFS-8000-8A
 scrub: resilver completed after 10h57m with 15190 errors on Thu May 19
09:26:59 2011
config:

        NAME           STATE     READ WRITE CKSUM
        zjumbo         DEGRADED     0     0  199K
          raidz1       DEGRADED     0     0  792K
            replacing  DEGRADED     0     0     0
              ad4/old  UNAVAIL      0 16.1M     0  cannot open
              ad4      ONLINE       0     0     0  1.15T resilvered
            ad6        ONLINE       0     0     0  677M resilvered
            ad8        ONLINE       0     0     0  660M resilvered
            ad10       ONLINE       0     0     0  535M resilvered

errors: 15190 data errors, use '-v' for a list




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?4DF209D6.4000107>