Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 24 Apr 2007 18:26:21 +0800
From:      LI Xin <delphij@delphij.net>
To:        Kostik Belousov <kostikbel@gmail.com>
Cc:        freebsd-stable@freebsd.org
Subject:   Re: 6.2-STABLE deadlock?
Message-ID:  <462DDB4D.8080507@delphij.net>
In-Reply-To: <20070423113912.GE2052@deviant.kiev.zoral.com.ua>
References:  <20070313140848.GA89182@steerpike.hanley.stade.co.uk>	<20070423025631.GA33256@steerpike.hanley.stade.co.uk> <20070423113912.GE2052@deviant.kiev.zoral.com.ua>

next in thread | previous in thread | raw e-mail | index | archive | help
This is an OpenPGP/MIME signed message (RFC 2440 and 3156)
--------------enig91AADDB1596CBF4AD608A17F
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Kostik Belousov wrote:
> On Mon, Apr 23, 2007 at 03:56:32AM +0100, Adrian Wontroba wrote:
>> On Tue, Mar 13, 2007 at 02:08:48PM +0000, Adrian Wontroba wrote:
>>> At work, amoungst my stable of old computers running FreeBSD, I have =
a
>>> Fujitsu M800 - a 4 Zeon SMP processor with 4 GB of memory. This
>>> primarily runs Nagios and a small and lightly used MySQL database, al=
ong
>>> with a few inbound FTP transfers per minute. It has a Mylex card base=
d
>>> disc subsystem, ruling out crash dumps.
>>>
>>> At some point during 5.5-STABLE this machine started to occasionally =
hang ...
>> Another 6-STABLE (cvsupped on 27/03/07) example, with diagnostics take=
n
>> rather sooner after the hang.  Processes with wmesg=3Dufs feature ofte=
n in
>> the ps output.
>>
>> http://www.stade.co.uk/crash1/
>=20
> I would suspect the mlx controller. There is several processes (for ins=
tance,
> 988, 50918) waiting for completion of block read, and processes in the =
"ufs"
> states are the result of the lock cascade, IMHO.

I'm not very sure if this is specific to one disk controller.  Actually
I got some occasional reports about similar hangs on amd64 6.2-RELEASE
(slightly patched version) that most of processes stuck in the 'ufs'
state, under very light load, the box was equipped with amr(4) RAID.

I was not able to reproduce the problem at my lab, though, it's still
unknown that how to trigger the livelock :-(  Still need some
investigate on their production system.

Cheers,
--=20
Xin LI <delphij@delphij.net>	http://www.delphij.net/
FreeBSD - The Power to Serve!


--------------enig91AADDB1596CBF4AD608A17F
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: OpenPGP digital signature
Content-Disposition: attachment; filename="signature.asc"

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (Darwin)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFGLdtNOfuToMruuMARCnabAJ94rde5il3J+4nL5sSOIfoLvH/O3wCeJgF9
nVv3N1lZcz/5mI5SmueCSxw=
=HnXN
-----END PGP SIGNATURE-----

--------------enig91AADDB1596CBF4AD608A17F--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?462DDB4D.8080507>