Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 7 Jul 2009 17:13:36 -0700
From:      "Mahlon E. Smith" <mahlon@martini.nu>
To:        Freddie Cash <fjwcash@gmail.com>
Cc:        freebsd-stable@freebsd.org
Subject:   Re: ZFS: drive replacement performance
Message-ID:  <20090708001336.GA95670@martini.nu>
In-Reply-To: <b269bc570907071532ub95af78i6ad3a09e8c6887d7@mail.gmail.com>
References:  <20090707195614.GA24326@martini.nu> <b269bc570907071354r36015689ha362ba83413efc46@mail.gmail.com> <20090707222631.GA70750@martini.nu> <b269bc570907071532ub95af78i6ad3a09e8c6887d7@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help

--azLHFNyN32YCQGCU
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Tue, Jul 07, 2009, Freddie Cash wrote:
>=20
> I think (never tried) you can use "zpool scrub -s store" to stop the
> resilver.  If not, you should be able to re-do the replace command.

Hmm.  I think I may be stuck.

% zpool scrub -s store

% zpool status | grep scrub
   scrub: resilver in progress for 0h0m, 0.00% done, 745h41m to go

% zpool replace store 2025342973333799752 da8
  invalid vdev specification
  use '-f' to override the following errors:
  /dev/da8 is part of active pool 'store'
 =20
% zpool replace -f store 2025342973333799752 da8
  invalid vdev specification
  the following errors must be manually repaired:
  /dev/da8 is part of active pool 'store'

% zpool detach store da8
  cannot detach da8: no valid replicas

% zpool detach store 2025342973333799752
  cannot detach 2025342973333799752: no valid replicas



I also tried another export/import cycle, in the random hope that would
stop the active replace -- no dice.  *However*, on the import, now I see
this flooding my console (wasn't there previously, strangely):

Jul  7 16:50:15 disobedience root: ZFS: vdev I/O failure, zpool=3Dstore pat=
h=3D/dev/da2 offset=3D262144 size=3D8192 error=3D6
Jul  7 16:50:15 disobedience root: ZFS: vdev I/O failure, zpool=3Dstore pat=
h=3D/dev/da2 offset=3D499988824064 size=3D8192 error=3D6

I now have to wonder if that's really the active da2 it is complaining
about (the one claiming to be online with 0 errors) or the one I'm
trying to replace with da8.  The current da2 doesn't seem to be
having any additional problems, like the checksum mismatches or other
associated console errors I've come to expect, but of course the old one
is no longer attached to the machine.  In any event, I'd wager that
isn't something I normally want to see, and I may have something else
going on here.  (Bad controller, etc?)

Serves me right for naming a machine 'disobedience', I guess.  Next one
is getting named 'subservience.'

Going to halt and pull da8 under the assumption that will at least stop
the resilver, and try the detach again.  I'll holler back if I get stuff
going again, but this is looking more like a hardware problem.

Thanks again for the insight!

-Mahlon

--
Mahlon E. Smith =20
http://www.martini.nu/contact.html

--azLHFNyN32YCQGCU
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----

iD8DBQFKU+Sw1bsjBDapbeMRAoN1AJ4hnaXAcsumQ4YPl6hgeS8j+b0+swCgq8O0
4X/YnS2iCHK8jd47S0D15SE=
=GIVG
-----END PGP SIGNATURE-----

--azLHFNyN32YCQGCU--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20090708001336.GA95670>