Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 15 Sep 2008 16:06:23 -0400
From:      John Baldwin <jhb@freebsd.org>
To:        freebsd-stable@freebsd.org
Cc:        Tim Chen <gphoto6@gmail.com>
Subject:   Re: Suddenly frozen fcntl/stat call on NFS over TCP with MTU 9000
Message-ID:  <200809151606.23933.jhb@freebsd.org>
In-Reply-To: <1f51039c0809150857l50b6be8eu848e21189a4175d6@mail.gmail.com>
References:  <1f51039c0809150857l50b6be8eu848e21189a4175d6@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
On Monday 15 September 2008 11:57:02 am Tim Chen wrote:
> Currently I was running a mail server using a netapp filer as backend
> storage.
> >From time to time, the whole system get stuck and lasted for 3-5 minutes.
> But
> after that, everything recovers normally. During the "stuck" moment, using
> ps
> auxw shows 200-300 of mail delivery agent(MDA) processes staying in "D"
> status.
> The command df certainly does not reponse either.

Can you use 'ps axl' to determine the wait mesg ("wchan") of the stuck threads 
when they hang?  If it is "lockf", then make sure you have an up-to-date 
RELENG_6 kernel as there was a recent fix for a "lockf" hang.

Alternatively, if things are stuck in "nfsreq", it may be useful to use 
tcpdump to look at the NFS requests your client is making.  nfsstat can also 
be useful as you can see which counters are increasing during a hang.

-- 
John Baldwin



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200809151606.23933.jhb>