From owner-freebsd-stable@FreeBSD.ORG Sat May 9 22:43:18 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 87883106564A for ; Sat, 9 May 2009 22:43:18 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from hub.org (hub.org [200.46.204.220]) by mx1.freebsd.org (Postfix) with ESMTP id 2BA168FC14 for ; Sat, 9 May 2009 22:43:18 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from localhost (maia-1.hub.org [200.46.208.211]) by hub.org (Postfix) with ESMTP id 9B5A953BC88; Sat, 9 May 2009 19:43:17 -0300 (ADT) Received: from hub.org ([200.46.204.220]) by localhost (mx1.hub.org [200.46.208.211]) (amavisd-maia, port 10024) with ESMTP id 31984-03; Sat, 9 May 2009 19:43:10 -0300 (ADT) Received: by hub.org (Postfix, from userid 1002) id 8CE9D53BC73; Sat, 9 May 2009 19:43:16 -0300 (ADT) Received: from localhost (localhost [127.0.0.1]) by hub.org (Postfix) with ESMTP id 8BA8A53BC63; Sat, 9 May 2009 19:43:16 -0300 (ADT) Date: Sat, 9 May 2009 19:43:16 -0300 (ADT) From: "Marc G. Fournier" To: Gavin Atkinson In-Reply-To: <1240920035.85945.7.camel@buffy.york.ac.uk> Message-ID: <20090509193937.T10574@hub.org> References: <1240920035.85945.7.camel@buffy.york.ac.uk> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Cc: Martin Schmidt , freebsd-stable@FreeBSD.org Subject: RE: 7.1-STABLE Sun Mar 29 01:06:46 ADT 2009 Locks up ... X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 09 May 2009 22:43:18 -0000 On Tue, 28 Apr 2009, Gavin Atkinson wrote: > On Fri, 2009-04-24 at 20:39 +0200, Martin Schmidt wrote: >> Hi Marc and List, >> >> i had similar issues with FreeBSD 7.2-PRERELEASE. Server (zfs,nfs) >> seems to hang in intervals of about 8 hours. >> kernel is still there but no connections can be made to nfs/ssh and >> login on local console doesn't seem to >> work due to incredible slowness. breaking to the debugger takes a >> moment but works. >> (compiling kernel with WITNESS didnt help) >> >> the server had been solid before with 7 stable kernel from around 19 >> October 2008. >> >> I now added these lines to /boot/loader.conf >> >> hw.pci.enable_msi=0 >> hw.pci.enable_msix=0 >> >> to disable Message Signaled Interrupts. Which are used by the 3ware >> twa driver and igb network driver on our server. > > If you are willing to test further on your server, it may be helpful if > you could determine which of those two lines in loader.conf fixes the > problem for you. It would also be useful to provide a dmesg from the > machine when both msi and msix are enabled. > > FWIW, looking at the "vmstat -i" output it appears that only the igb > driver that are using MSI/MSIX, unless you have a reason to suspect > otherwise? How do you tell that, about igb? looking at the server I have the igb device on, it doesn't seem to say anything about that ... # vmstat -i interrupt total rate irq1: atkbd0 162 0 irq30: twa0 402647215 187 cpu0: timer 4284778818 1999 irq256: igb0 1282945461 598 irq257: igb0 215507100 100 irq258: igb0 417702261 194 irq259: igb0 314601966 146 irq260: igb0 568062067 265 irq261: igb0 3 0 cpu5: timer 4284744445 1999 cpu6: timer 4284731466 1999 cpu7: timer 4284724508 1999 cpu1: timer 4284893874 1999 cpu3: timer 4284899807 1999 cpu2: timer 4284892325 1999 cpu4: timer 4284897264 1999 Total 37480028742 17493 The server(s) that I am experiencing the hangs on, vmstat -i shows: # vmstat -i interrupt total rate irq1: atkbd0 2 0 irq3: sio1 8 0 irq25: bge0 4614816 213 irq72: ciss0 1835763 85 cpu0: timer 43113685 1997 cpu1: timer 43116889 1997 Total 92681163 4293 Are any of these similiarly using MSI/MSIX? ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email . scrappy@hub.org MSN . scrappy@hub.org Yahoo . yscrappy Skype: hub.org ICQ . 7615664