Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 5 Oct 2006 11:30:27 +0300
From:      Kostik Belousov <kostikbel@gmail.com>
To:        Vivek Khera <vivek@khera.org>
Cc:        stable@freebsd.org
Subject:   Re: ffs snapshot lockup
Message-ID:  <20061005083027.GK89654@deviant.kiev.zoral.com.ua>
In-Reply-To: <40CE3CF0-49D2-4335-A0B8-34B5251E9E19@khera.org>
References:  <917B087C-5E13-4D7F-94FA-95CB0E5C1884@khera.org> <20060922190328.GA64849@xor.obsecurity.org> <555B84D2-520F-44D6-84D6-CF9CE7EE47C7@khera.org> <20060922203654.GA65693@xor.obsecurity.org> <847DD3A5-D5DD-4D3E-B755-64B13D1DA506@khera.org> <20061003084315.GA89654@deviant.kiev.zoral.com.ua> <40CE3CF0-49D2-4335-A0B8-34B5251E9E19@khera.org>

next in thread | previous in thread | raw e-mail | index | archive | help

--+Hr//EUsa8//ouuB
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Wed, Oct 04, 2006 at 05:16:53PM -0400, Vivek Khera wrote:
>=20
> On Oct 3, 2006, at 4:43 AM, Kostik Belousov wrote:
>=20
> >>Details are posted at http://vivek.khera.org/scratch/crashlogs/
> >>
> >>I have the crashdumps available to a kernel hacker upon request (i'd
> >>rather not make them generally available to the public...)
> >>
> >It seems that you have snapshotted fs exported by nfsd ? At least, =20
> >18a is
> >definitely the case. I have the patch (for current) that shall fix =20
> >the issue.
> >In fact, you need two patches:
>=20
> As per advice of Kris Kenneway, I turned off the software watchdog to =20
> rule out that as my problem. Then I ran a level 3 dump. Dump of root =20
> fs went fine, then it proceeded to do /usr. After a few minutes it =20
> locked up. Typescript 20 at the above URL shows the debugging info =20
> from the break into debugger of the locked up system. Since /usr was =20
> locked, nobody could log in at all.
>=20
> The network load was minimal at the time.  I had everyone log out and =20
> close mail etc.
>=20

What were the symptoms of locked system ? Could you log in on console, or
do something at the shell prompt on console ?

Also, did the system respond to the pings ? Fs-related deadlocks (as
well as stalled disk io) usually do not prevent lowest levels of the
isr/network stack from working.

Again, I do not see the fs deadlock per se in the supplied script. Dump does
disk io, it seems that nfsd tries to serve some request. Sshd looks to be
ready to accept connections.

If console is available, but ping responses not arrive, this is definitely
network card problem.

--+Hr//EUsa8//ouuB
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (FreeBSD)

iD8DBQFFJMKjC3+MBN1Mb4gRAqFmAKCucL3cM5rG0NRnf62VisTTomK/xACbBVeb
cxFiuvG1eCxhPMPaLmWX+tg=
=opbq
-----END PGP SIGNATURE-----

--+Hr//EUsa8//ouuB--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20061005083027.GK89654>