From owner-freebsd-stable@FreeBSD.ORG Fri Feb 26 13:34:01 2010 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 436BB1065670 for ; Fri, 26 Feb 2010 13:34:01 +0000 (UTC) (envelope-from danny@cs.huji.ac.il) Received: from kabab.cs.huji.ac.il (kabab.cs.huji.ac.il [132.65.16.84]) by mx1.freebsd.org (Postfix) with ESMTP id E89C48FC20 for ; Fri, 26 Feb 2010 13:34:00 +0000 (UTC) Received: from pampa.cs.huji.ac.il ([132.65.80.32]) by kabab.cs.huji.ac.il with esmtp id 1Nkzry-0000AC-0S; Fri, 26 Feb 2010 15:04:38 +0200 X-Mailer: exmh version 2.7.2 01/07/2005 with nmh-1.2 To: Gerrit =?ISO-8859-1?Q?K=FChn?= In-reply-to: <20100226134429.041ea6f2.gerrit@pmp.uni-hannover.de> References: <4B86F384.3010308@digiware.nl> <2a41acea1002251459v40e8c6ddxd0437decbada4594@mail.gmail.com> <4B8795B1.4020006@digiware.nl> <20100226120339.GB17798@icarus.home.lan> <20100226133138.d47dd080.gerrit@pmp.uni-hannover.de> <20100226134429.041ea6f2.gerrit@pmp.uni-hannover.de> Comments: In-reply-to Gerrit =?ISO-8859-1?Q?K=FChn?= message dated "Fri, 26 Feb 2010 13:44:29 +0100." Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable Date: Fri, 26 Feb 2010 15:04:37 +0200 From: Daniel Braniss Message-ID: Cc: stable@freebsd.org, Willem Jan Withagen , Jack Vogel , Jeremy Chadwick Subject: Re: em0 freezes on ZFS server X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 26 Feb 2010 13:34:01 -0000 > On Fri, 26 Feb 2010 13:31:38 +0100 Gerrit K=FChn > wrote about Re: em0 freezes on ZFS serve= r: >=20 > GK> JC> Note how close the =22current=22 value is to that of =22total= =22. I'm not > GK> JC> too surprised you're seeing what you are as a result of this. > GK> JC> What on earth is this machine doing at all times? >=20 > GK> Is there any way I could find out what is actually using these buff= ers? >=20 > Sorry for replying to my own email: > At least in my case I found out what is eating the buffers: nfsd does= =21 > The buffers stop increasing as soon as I stop nfsd. However, they start= > increasing as soon as I start nfsd again. > Are there any ideas how to fix this? Downgrading back to 7-stable is no= t > really an easy task as far as I know, and I need the server to run with= out > having to reboot it once for twice a day... I want to add some spices to this stew: :-) I have this big server (> 10 TB) which was running pretty much without ma= jor problems, till one morning it started panicking because some 'ZFS * crede= ntial *', Since this server is used by many and uptime being a priority, I upgraded it to 8-stable, the panic went away, one problem solved. Some few day later it hung, and it's now hanging every few days. Most of the hangs are because there is no network, but the NIC is bce not= em=21 I doubled kern.ipc.nmbclusters and lets see what happens ... netstat -m: 23066/6634/29700 mbufs in use (current/cache/total) 22072/5942/28014/51200 mbuf clusters in use (current/cache/total/max) 22021/2939 mbuf+clusters out of packet secondary zone in use (current/cac= he) hope this helps in finding a cure, danny