Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 26 Jul 2007 12:11:49 +0100 (BST)
From:      Robert Watson <rwatson@FreeBSD.org>
To:        Kris Kennaway <kris@obsecurity.org>
Cc:        arch@FreeBSD.org, Anders Nordby <anders@FreeBSD.org>, jkim@FreeBSD.org
Subject:   Re: Removing NET_NEEDS_GIANT: first patch
Message-ID:  <20070726120713.Q15979@fledge.watson.org>
In-Reply-To: <20070726105358.GA43979@rot26.obsecurity.org>
References:  <20070724110908.T83919@fledge.watson.org> <20070726102328.GA12293@fupp.net> <20070726105358.GA43979@rot26.obsecurity.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On Thu, 26 Jul 2007, Kris Kennaway wrote:

>> I've used and still use debug_mpsafenet to get rid of watchdog timeout 
>> problems on a lot of HP Proliant servers, particularly with the bge driver:
>>
>> Dec 21 06:42:51 videovm1 kernel: bge0: watchdog timeout -- resetting
>> Dec 21 06:42:51 videovm1 kernel: bge0: link state changed to DOWN
>> Dec 21 06:42:54 videovm1 kernel: bge0: link state changed to UP
>>
>> This problem goes away with debug.mpsafenet="0", for me.
>>
>> I can try to turn off this setting, and see how it goes. I remember there 
>> was something one could do, to get more information about the watchdog 
>> error, but can't remember what.
>
> Please do.  There is no sense in crippling your network for the sake of an 
> unre{solved,ported} driver bug.

I agree with what Kris said, only more so. :-)  By masking the bug using 
debug.mpsafenet, whatever bug is the root of the problem isn't getting fixed, 
and instead keeps going out in releases.  This sounds like it's most likely a 
driver bug, although I wouldn't rule out some sort of interrupt problem.  It 
looks like jkim might be someone to talk to about this (CC'd).

Robert N M Watson
Computer Laboratory
University of Cambridge



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20070726120713.Q15979>