Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 23 Jan 2010 01:31:45 -0800
From:      Artem Belevich <fbsdlist@src.cx>
To:        Rich <rincebrain@gmail.com>
Cc:        freebsd-fs <freebsd-fs@freebsd.org>
Subject:   Re: Errors on a file on a zpool: How to remove?
Message-ID:  <ed91d4a81001230131m29446dcm1f4b5706c23665bc@mail.gmail.com>
In-Reply-To: <5da0588e1001230014k1b8a32f8v42046497265429ed@mail.gmail.com>
References:  <5da0588e1001222223m773648am907267235bdcf882@mail.gmail.com> <ed91d4a81001230011t7aef2da8h3be13d2494c06550@mail.gmail.com> <5da0588e1001230014k1b8a32f8v42046497265429ed@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
> errors: Permanent errors have been detected in the following files:
> =A0 =A0 =A0 =A0rigatoni/mirrors:<0x0>

This looks similar to what I had. My wild guess would be that some
metadata got corrupted.

If you really want to fix it, you may need to get close and personal
with zdb and  ZFS on-disk format spec here:
http://hub.opensolaris.org/bin/download/Community+Group+zfs/docs/ondiskform=
at0822.pdf

Following video and PDF may help a bit:
http://video.google.com/videoplay?docid=3D2325724487196148104#
http://www.osdevcon.org/2008/files/osdevcon2008-max.pdf

If all you want is to avoid nagging errors you can try going up the
tree until you can do something with directory. Then move it somewhere
where it would not get in the way, "chmod 000" it and perhaps even do
"setflags schg" on it to prevent anyone from descending into directory
with bad files.

--Artem


On Sat, Jan 23, 2010 at 12:14 AM, Rich <rincebrain@gmail.com> wrote:
> I already diagnosed the bad hardware - one of the two sticks of RAM
> had gone bad, and fails memtest in the other machine.
>
> =A0pool: rigatoni
> =A0state: ONLINE
> status: One or more devices has experienced an error resulting in data
> =A0 =A0 =A0 =A0corruption. =A0Applications may be affected.
> action: Restore the file in question if possible. =A0Otherwise restore th=
e
> =A0 =A0 =A0 =A0entire pool from backup.
> =A0 see: http://www.sun.com/msg/ZFS-8000-8A
> =A0scrub: scrub completed after 15h28m with 1 errors on Thu Jan 21 18:09:=
25 2010
> config:
>
> =A0 =A0 =A0 =A0NAME =A0 =A0 =A0 =A0STATE =A0 =A0 READ WRITE CKSUM
> =A0 =A0 =A0 =A0rigatoni =A0 =A0ONLINE =A0 =A0 =A0 0 =A0 =A0 0 =A0 =A0 1
> =A0 =A0 =A0 =A0 =A0da4 =A0 =A0 =A0 ONLINE =A0 =A0 =A0 0 =A0 =A0 0 =A0 =A0=
 2
> =A0 =A0 =A0 =A0 =A0da5 =A0 =A0 =A0 ONLINE =A0 =A0 =A0 0 =A0 =A0 0 =A0 =A0=
 2
> =A0 =A0 =A0 =A0 =A0da7 =A0 =A0 =A0 ONLINE =A0 =A0 =A0 0 =A0 =A0 0 =A0 =A0=
 0
> =A0 =A0 =A0 =A0 =A0da6 =A0 =A0 =A0 ONLINE =A0 =A0 =A0 0 =A0 =A0 0 =A0 =A0=
 0
> =A0 =A0 =A0 =A0 =A0da2 =A0 =A0 =A0 ONLINE =A0 =A0 =A0 0 =A0 =A0 0 =A0 =A0=
 2
>
> errors: Permanent errors have been detected in the following files:
>
> =A0 =A0 =A0 =A0rigatoni/mirrors:<0x0>
>
> Scrubbing repeatedly does nothing to remove the note about that error,
> and I'd rather like to avoid trying to recreate a 7TB pool.
>
> - Rich
>
> On Sat, Jan 23, 2010 at 3:11 AM, Artem Belevich <fbsdlist@src.cx> wrote:
>> The directory that those files are in may be corrupted. What does
>> zpool status -v show?
>>
>> You may want to scrub the pool if you haven't done so yet. That would
>> help to find all corrupted files.
>>
>> When plain files are corrupted, you should be able to remove them. You
>> may also try to set atime=3Doff on the filesystem to avoid filesystem
>> updates on reads.
>> Some time back when I had zpool corruption I've found no way to remove
>> corrupted directory that still had some files in it. In the end I had
>> to rebuild the pool.
>>
>> BTW, given that your pool did get corrupted, perhaps it might be a
>> good idea to start moving your data somewhere else rather than worry
>> about how to remove corrupted files. If corruption is due to bad
>> hardware, bad files would just keep popping up.
>>
>> --Artem
>>
>>
>>
>> On Fri, Jan 22, 2010 at 10:23 PM, Rich <rincebrain@gmail.com> wrote:
>>> Hey world,
>>> I've got a series of files in a non-redundant zpool which all report
>>> Input/Output Error on attempting to manipulate them in any way - stat,
>>> read, rm, anything.
>>>
>>> Whenever anything is attempted, the following style of thing is
>>> printed to /var/log/messages:
>>> Jan 23 01:22:34 manticore root: ZFS: checksum mismatch, zpool=3Drigaton=
i
>>> path=3D/dev/da4 offset=3D1231402180608 size=3D8192
>>> Jan 23 01:22:34 manticore root: ZFS: checksum mismatch, zpool=3Drigaton=
i
>>> path=3D/dev/da5 offset=3D446136819712 size=3D8192
>>> Jan 23 01:22:34 manticore root: ZFS: checksum mismatch, zpool=3Drigaton=
i
>>> path=3D/dev/da2 offset=3D320393101312 size=3D8192
>>> Jan 23 01:22:34 manticore root: ZFS: checksum mismatch, zpool=3Drigaton=
i
>>> path=3D/dev/da5 offset=3D446136819712 size=3D8192
>>> Jan 23 01:22:34 manticore root: ZFS: checksum mismatch, zpool=3Drigaton=
i
>>> path=3D/dev/da2 offset=3D320393101312 size=3D8192
>>> Jan 23 01:22:34 manticore root: ZFS: checksum mismatch, zpool=3Drigaton=
i
>>> path=3D/dev/da4 offset=3D1231402180608 size=3D8192
>>> Jan 23 01:22:35 manticore root: ZFS: zpool I/O failure, zpool=3Drigaton=
i error=3D86
>>>
>>> What can I do? I really would like to just purge all of these files
>>> from orbit, since I can recreate them, but I can't seem to delete
>>> them, and deleting the pool is a really inconvenient option, as I have
>>> other data on it.
>>>
>>> I'm running 8.0-RELEASE stock on amd64.
>>>
>>> Thanks!
>>>
>>> - Rich
>>> _______________________________________________
>>> freebsd-fs@freebsd.org mailing list
>>> http://lists.freebsd.org/mailman/listinfo/freebsd-fs
>>> To unsubscribe, send any mail to "freebsd-fs-unsubscribe@freebsd.org"
>>>
>>
>
>
>
> --
>
> Todo homem morre, mas nem todo homem vive. -- William Wallace
>



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?ed91d4a81001230131m29446dcm1f4b5706c23665bc>