Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 27 Aug 2013 18:32:01 +0300
From:      Daniel Braniss <danny@cs.huji.ac.il>
To:        Konstantin Belousov <kostikbel@gmail.com>
Cc:        Rick Macklem <rmacklem@uoguelph.ca>, freebsd-stable <freebsd-stable@freebsd.org>
Subject:   Re: another? NFS deadlock on 9.2-PRERELEASE
Message-ID:  <E1VELFR-000J7V-Hv@kabab.cs.huji.ac.il>
In-Reply-To: <20130827140412.GQ4972@kib.kiev.ua>
References:  <1740270726.14074212.1377605362382.JavaMail.root@uoguelph.ca>  <E1VEJoc-000IG5-1b@kabab.cs.huji.ac.il> <20130827140412.GQ4972@kib.kiev.ua>

next in thread | previous in thread | raw e-mail | index | archive | help
> 
> --wdMRLhhF94AmkTAJ
> Content-Type: text/plain; charset=us-ascii
> Content-Disposition: inline
> Content-Transfer-Encoding: quoted-printable
> 
> On Tue, Aug 27, 2013 at 05:00:14PM +0300, Daniel Braniss wrote:
> > > Daniel Braniss wrote:
> > > > > Daniel Braniss wrote:
> > > > > > I upgraded our web server, and only after 3 hours it hung :-(
> > > > > > (as a side note, I have 2 other web servers, also running 9.2
> > > > > > doing
> > > > > > great :-)
> > > > > > go figure.
> > > > > >=20
> > > > > > anyways, in
> > > > > > 	ftp://ftp.cs.huji.ac.il/users/danny/freebsd/core.txt/0
> > > > > >=20
> > > > > > is the info after a forced panic.
> > > > > >=20
> > > > > Looks like the same hang to me. Several threads are sleeping on
> > > > > "pgrbwt"
> > > > > and lots are waiting for an NFS vnode lock.
> > > > >=20
> > > > > It should be fixed in RC3 (or revert r250907). If it still hangs
> > > > > with
> > > > > RC3 (or r250907 reverted), email again.
> > > > >=20
> > > > im following stable, hence it's till calling itself 9.2-PRERELEASE,
> > > > but
> > > > I did a sync this morning - local time, after rc3 was anounced.
> > > > but after 3.45 minutes is hung, data in
> > > > 	ftp://ftp.cs.huji.ac.il/users/danny/freebsd/core.txt/1
> > > >=20
> > > > I can't easely revert r250907, since i'm using mercuriall, but if
> > > > someone
> > > > can send me the pre r250907 files, i'll try.
> > > >=20
> > > r254947, which was committed to stable/9 a few hours ago is believed to
> > > fix the problem. Please update your stable/9 to post-r254947 and try it.
> > >=20
> > the current kernel has that fix (sys/kern/uipc_syscalls.c)
> > and if you check the core.txt/1 you will see no pgrbwt, only newnsf ...
> 
> There is almost no useful information in the core.txt/1.
> Provide the known data for the deadlock.

maybe the word deadlock is too strong, the host is diskless (one of many)
and so when NFS stops working/respondig, it hangs.
I will now make it dataless - the / will be from local disk, see if that
makes it easier to debug.







Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?E1VELFR-000J7V-Hv>