Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 19 Jul 2006 11:49:11 -0300 (ADT)
From:      User Freebsd <freebsd@hub.org>
To:        Kostik Belousov <kostikbel@gmail.com>
Cc:        freebsd-stable@freebsd.org, Robert Watson <rwatson@freebsd.org>
Subject:   Re: file system deadlock - the whole story?
Message-ID:  <20060719114751.X1799@ganymede.hub.org>
In-Reply-To: <20060719144305.GM1464@deviant.kiev.zoral.com.ua>
References:  <20060705100403.Y80381@fledge.watson.org> <cone.1152136419.991036.72616.1000@zoraida.natserv.net> <20060705234514.I70011@fledge.watson.org> <20060715000351.U1799@ganymede.hub.org> <20060715035308.GJ32624@deviant.kiev.zoral.com.ua> <20060718074804.W1799@ganymede.hub.org> <20060719112424.GK1464@deviant.kiev.zoral.com.ua> <20060719082627.H1799@ganymede.hub.org> <20060719151327.H5132@fledge.watson.org> <20060719112208.Y1799@ganymede.hub.org> <20060719144305.GM1464@deviant.kiev.zoral.com.ua>

next in thread | previous in thread | raw e-mail | index | archive | help
On Wed, 19 Jul 2006, Kostik Belousov wrote:

> On Wed, Jul 19, 2006 at 11:23:21AM -0300, User Freebsd wrote:
>> On Wed, 19 Jul 2006, Robert Watson wrote:
>>
>>>
>>> On Wed, 19 Jul 2006, User Freebsd wrote:
>>>
>>>> Also note that under FreeBSD 4.x, all three of these machines were pretty
>>>> much my more solid machines, with even more vServers running on them then
>>>> I'm able to run with 6.x ... once I got rid of using unionfs, stability
>>>> skyrocketed :(
>>>>
>>>> Hrmmmm ... but, your 'controller driver' comment ... that is one common
>>>> thing amongst all three servers ... they are all running the iir driver
>>>> ... not sure the *exact* controller, but pluto (older Dual-PIII) shows it
>>>> as:
>>>
>>> Yes, this was going to be my next question -- if you're seeing wedges
>>> under load and there's a common controller in use, maybe we're looking at
>>> a driver bug.  Bugs of those sort typically look a lot like what you
>>> describe: an I/O is "lost" and so eveything that depends on the I/O wedges
>>> waiting for it, leading to a lot of processes hanging around waiting for
>>> vnode locks, etc.
>>
>> 'k, but how do we debug *that*? :(  If it was one, I'd suspect hardware
>> ... but *three*, and only acting up *after* upgrading to FreeBSD 6.x, and
>> only acting up under load ...
>
> Obvious step would be to replace controller by some different kind.

Unfortunately, that one isn't an option ... these aren't local machines 
that I can easily swap hardware in :(

----
Marc G. Fournier           Hub.Org Networking Services (http://www.hub.org)
Email . scrappy@hub.org                              MSN . scrappy@hub.org
Yahoo . yscrappy               Skype: hub.org        ICQ . 7615664



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20060719114751.X1799>