Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 10 Oct 2013 21:09:37 +0300
From:      Mikolaj Golub <trociny@FreeBSD.org>
To:        Steven Hartland <killing@multiplay.co.uk>
Cc:        freebsd-stable@freebsd.org, Pete French <petefrench@ingresso.co.uk>
Subject:   Re: hast and zfs trim possibly causing some problems in 9.2
Message-ID:  <20131010180936.GP44942@gmail.com>
In-Reply-To: <DE6E10A421A8451699610A30D9344257@multiplay.co.uk>
References:  <E1VTuhN-0006UM-73@dilbert.ingresso.co.uk> <DE6E10A421A8451699610A30D9344257@multiplay.co.uk>

next in thread | previous in thread | raw e-mail | index | archive | help
On Wed, Oct 09, 2013 at 03:47:29PM +0100, Steven Hartland wrote:

> ZFS will try to send DELETE requests to the underlying storage to
> support TRIM. If that fails then it will disable TRIM support for
> that vdev.
> 
> My guess would be you're just seeing hast being a bit verbose
> when these initial batch failures happen.

If the device on the secondary node does not supports DELETE, but the
device on the primary does, HAST will report to ZFS that DELETE
succeeded (although it failed on the secondary), and ZFS will not
disable TRIM. Pete, isn't this your case?

> From: "Pete French" <petefrench@ingresso.co.uk>
> 
> >I just had a machine fall over on my for the first time in ages - one
> > of a pair of machine we have running hast with zfs on top. I havent
> > got any concrete evidence of what made it die as yet, but I
> > did notice the logifles filling up with thoursands of lines like this
> > just prior to the crash:
> > 
> > serpentine-active hastd[1522]: [serp1] (primary) Remote request failed (Operation not supported): DELETE(26847744000, 1536).
> > 
> > so I am guessing taht is ZFS trying to send a trim command to hast, and hast
> > does not support it. Have disabled zfs trim now, but thought it was
> > worth mentioning - I would have not expected zfs to be trying to issue
> > a trim command to an underlying device which doesnt support it. These
> > machines were rock solid under 8, and the only chnage I can see with 9 is
> > the trim support being added.

Another important change that comes to mind is the default replication
mode, changed from fullsync to memsync. Do you have the replication
mode explicitly set in your config?

-- 
Mikolaj Golub



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20131010180936.GP44942>