Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 19 Jul 2006 17:43:05 +0300
From:      Kostik Belousov <kostikbel@gmail.com>
To:        User Freebsd <freebsd@hub.org>
Cc:        freebsd-stable@freebsd.org, Robert Watson <rwatson@freebsd.org>
Subject:   Re: file system deadlock - the whole story?
Message-ID:  <20060719144305.GM1464@deviant.kiev.zoral.com.ua>
In-Reply-To: <20060719112208.Y1799@ganymede.hub.org>
References:  <20060705100403.Y80381@fledge.watson.org> <cone.1152136419.991036.72616.1000@zoraida.natserv.net> <20060705234514.I70011@fledge.watson.org> <20060715000351.U1799@ganymede.hub.org> <20060715035308.GJ32624@deviant.kiev.zoral.com.ua> <20060718074804.W1799@ganymede.hub.org> <20060719112424.GK1464@deviant.kiev.zoral.com.ua> <20060719082627.H1799@ganymede.hub.org> <20060719151327.H5132@fledge.watson.org> <20060719112208.Y1799@ganymede.hub.org>

next in thread | previous in thread | raw e-mail | index | archive | help

--j3zO+32zXj6UcJCE
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Wed, Jul 19, 2006 at 11:23:21AM -0300, User Freebsd wrote:
> On Wed, 19 Jul 2006, Robert Watson wrote:
>=20
> >
> >On Wed, 19 Jul 2006, User Freebsd wrote:
> >
> >>Also note that under FreeBSD 4.x, all three of these machines were pret=
ty=20
> >>much my more solid machines, with even more vServers running on them th=
en=20
> >>I'm able to run with 6.x ... once I got rid of using unionfs, stability=
=20
> >>skyrocketed :(
> >>
> >>Hrmmmm ... but, your 'controller driver' comment ... that is one common=
=20
> >>thing amongst all three servers ... they are all running the iir driver=
=20
> >>... not sure the *exact* controller, but pluto (older Dual-PIII) shows =
it=20
> >>as:
> >
> >Yes, this was going to be my next question -- if you're seeing wedges=20
> >under load and there's a common controller in use, maybe we're looking a=
t=20
> >a driver bug.  Bugs of those sort typically look a lot like what you=20
> >describe: an I/O is "lost" and so eveything that depends on the I/O wedg=
es=20
> >waiting for it, leading to a lot of processes hanging around waiting for=
=20
> >vnode locks, etc.
>=20
> 'k, but how do we debug *that*? :(  If it was one, I'd suspect hardware=
=20
> ... but *three*, and only acting up *after* upgrading to FreeBSD 6.x, and=
=20
> only acting up under load ...

Obvious step would be to replace controller by some different kind.

--j3zO+32zXj6UcJCE
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.4 (FreeBSD)

iD8DBQFEvkT5C3+MBN1Mb4gRAkNDAKCZORLzP9p4pyUCwPjj///jfwGC7ACg5UCP
bGGFe0/owbW1Z5J1eN26Gbs=
=2gY6
-----END PGP SIGNATURE-----

--j3zO+32zXj6UcJCE--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20060719144305.GM1464>