Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 28 Aug 2013 09:20:22 +0300
From:      Daniel Braniss <danny@cs.huji.ac.il>
To:        Rick Macklem <rmacklem@uoguelph.ca>
Cc:        freebsd-stable <freebsd-stable@freebsd.org>
Subject:   Re: another? NFS deadlock on 9.2-PRERELEASE
Message-ID:  <E1VEZ78-0002PT-If@kabab.cs.huji.ac.il>
In-Reply-To: <161178921.14385682.1377634187359.JavaMail.root@uoguelph.ca>
References:  <161178921.14385682.1377634187359.JavaMail.root@uoguelph.ca>

next in thread | previous in thread | raw e-mail | index | archive | help
> Daniel Braniss wrote:
> > > Daniel Braniss wrote:
> > > > > Daniel Braniss wrote:
> > > > > > I upgraded our web server, and only after 3 hours it hung :-(
> > > > > > (as a side note, I have 2 other web servers, also running 9.2
> > > > > > doing
> > > > > > great :-)
> > > > > > go figure.
> > > > > > 
> > > > > > anyways, in
> > > > > > 	ftp://ftp.cs.huji.ac.il/users/danny/freebsd/core.txt/0
> > > > > > 
> > > > > > is the info after a forced panic.
> > > > > > 
> > > > > Looks like the same hang to me. Several threads are sleeping on
> > > > > "pgrbwt"
> > > > > and lots are waiting for an NFS vnode lock.
> > > > > 
> > > > > It should be fixed in RC3 (or revert r250907). If it still
> > > > > hangs
> > > > > with
> > > > > RC3 (or r250907 reverted), email again.
> > > > > 
> > > > im following stable, hence it's till calling itself
> > > > 9.2-PRERELEASE,
> > > > but
> > > > I did a sync this morning - local time, after rc3 was anounced.
> > > > but after 3.45 minutes is hung, data in
> > > > 	ftp://ftp.cs.huji.ac.il/users/danny/freebsd/core.txt/1
> > > > 
> > > > I can't easely revert r250907, since i'm using mercuriall, but if
> > > > someone
> > > > can send me the pre r250907 files, i'll try.
> > > > 
> The pre-r250907 version of uipc_syscalls is at:
>  http://people.freebsd.org/~rmacklem/uipc_syscalls.c
> in case you want to try it.
> 
thanks, I think I have a kernel pre r250907.
In the meantime I did 2 things:
	1- made the /(root) local - as opposed to nfs'ed
	2- have a watchdog to reboot in case of hang

the host has been up for more than 14hs ( i doubt it's because of 2 -)

lets see how things develope

thanks,
	danny

> rick
> 
> > > r254947, which was committed to stable/9 a few hours ago is
> > > believed to
> > > fix the problem. Please update your stable/9 to post-r254947 and
> > > try it.
> > > 
> > the current kernel has that fix (sys/kern/uipc_syscalls.c)
> > and if you check the core.txt/1 you will see no pgrbwt, only newnsf
> > ...
> > 
> > danny
> > 
> > > rick
> > > 
> > > > thanks,
> > > > 	danny
> > > > 
> > > > > rick
> > > > > 
> > > > > > my guts say its running out of resources - mainly network
> > > > > > related,
> > > > > > but
> > > > > > can't pinpoint it.
> > > > > > 
> > > > > > any help will be most welcomed
> > > > > > 
> > > > > > cheers,
> > > > > > 	danny
> > > > > > 
> > > > > > 
> > > > > > 
> > > > > > 
> > > > > > 
> > > > 
> > > > 
> > > > _______________________________________________
> > > > freebsd-stable@freebsd.org mailing list
> > > > http://lists.freebsd.org/mailman/listinfo/freebsd-stable
> > > > To unsubscribe, send any mail to
> > > > "freebsd-stable-unsubscribe@freebsd.org"
> > > > 
> > 
> > 
> > 





Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?E1VEZ78-0002PT-If>