Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 16 Nov 2009 23:16:27 +0100
From:      Roland Smith <rsmith@xs4all.nl>
To:        Bruce Cran <bruce@cran.org.uk>
Cc:        freebsd-questions@freebsd.org, "Ronald F. Guilmette" <rfg@tristatelogic.com>
Subject:   Re: Bad Blocks... Should I RMA?
Message-ID:  <20091116221627.GA2725@slackbox.xs4all.nl>
In-Reply-To: <20091116214331.00007810@unknown>
References:  <42307.1258330015@tristatelogic.com> <20091116182358.GA95918@slackbox.xs4all.nl> <20091116214331.00007810@unknown>

Next in thread | Previous in thread | Raw E-Mail | Index | Archive | Help

--9jxsPFA5p3P2qPhR
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Mon, Nov 16, 2009 at 09:43:31PM +0000, Bruce Cran wrote:
> On Mon, 16 Nov 2009 19:23:58 +0100
> Roland Smith <rsmith@xs4all.nl> wrote:
>=20
> > Install the smartmontools port, and check the drive with=20
> > 'smartctl -a /dev/ad4'. If you see a non-zero Reallocated_Sector_Ct,
> > RMA it immediately, as it is about to fail. If see other errors
> > reported, RMA it.
> >=20
> > (S)ATA disk have spare sectors available. If a sector fails, it is
> > replaced by one of the spares by the firmware. If you see a non-zero
> > Reallocated_Sector_Ct, it means that the drive has run out of spares.
> > This is bad news.
>=20
> Surely it's the other way around - if you see a value of zero in the
> "value" column the drive has run out of spare sectors and it's time to
> RMA the drive?

I was talking about the _RAW_VALUE column. There seems to be some differenc=
es
in interpretation between vendors as to what the VALUE column means. Most of
the advice I've seen over the years says to look at the RAW_VALUE.

See http://en.wikipedia.org/wiki/S.M.A.R.T. as well.

> From what I've seen the 'raw' column appears to count
> the number of sectors the drive has remapped using the spares buffer.
> If it gets into the hundreds it's probably time to think about RMA'ing
> the drive

Yes, the raw value is the number of sectors allocated from the spares. I
originally thought it was the number of reallocations _beyond_ the
spares. That's a misunderstanding on my part.

Nevertheless this attribute (along with several) is marked on the Wikipedia
page for smart as a "Potential indicator of imminent electromechanical
failure". You can find the same attributes marked as critical when perusing
mailing list archives.

For me, my data is worth much more than the harddisk it is on. Some of it is
literally irreplacable. So my policy is to go look for a replacement harddi=
sk
as soon as the RAW_VALUEs of any of these critical indicators start going up
=66rom zero. And store any data at least on two harddisks, whether in a mir=
ror
or in a cron+rsync setup.

Roland
--=20
R.F.Smith                                   http://www.xs4all.nl/~rsmith/
[plain text _non-HTML_ PGP/GnuPG encrypted/signed email much appreciated]
pgp: 1A2B 477F 9970 BA3C 2914  B7CE 1277 EFB0 C321 A725 (KeyID: C321A725)

--9jxsPFA5p3P2qPhR
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.13 (FreeBSD)

iEYEARECAAYFAksBzzsACgkQEnfvsMMhpyW71gCggLsGWSSlMogKU5LESkglK4wK
rnwAn2Cf9ynSpuLEExhkmPn2j9oM0jNn
=sMBN
-----END PGP SIGNATURE-----

--9jxsPFA5p3P2qPhR--



Want to link to this message? Use this URL: <http://docs.FreeBSD.org/cgi/mid.cgi?20091116221627.GA2725>