From owner-freebsd-stable@FreeBSD.ORG Tue Apr 24 19:23:36 2007 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 19B9F16A402 for ; Tue, 24 Apr 2007 19:23:36 +0000 (UTC) (envelope-from oleg@vsi.ru) Received: from serv2.vsi.ru (serv2.vsi.ru [80.82.32.11]) by mx1.freebsd.org (Postfix) with ESMTP id 8CE0513C43E for ; Tue, 24 Apr 2007 19:23:35 +0000 (UTC) (envelope-from oleg@vsi.ru) Received: from serv2.vsi.ru (localhost [127.0.0.1]) by serv2.vsi.ru (8.13.8/8.13.8) with ESMTP id l3OJNNMl017670; Tue, 24 Apr 2007 23:23:24 +0400 (MSD) (envelope-from oleg@vsi.ru) Received: (from nobody@localhost) by serv2.vsi.ru (8.13.8/8.13.8/Submit) id l3OJN5Xw017662; Tue, 24 Apr 2007 23:23:05 +0400 (MSD) (envelope-from oleg@vsi.ru) X-Authentication-Warning: serv2.vsi.ru: nobody set sender to oleg@vsi.ru using -f To: LI Xin Message-ID: <1177442585.462e5919c71f0@webmail.vsi.ru> Date: Tue, 24 Apr 2007 23:23:05 +0400 (MSD) From: Oleg Derevenetz References: <20070313140848.GA89182@steerpike.hanley.stade.co.uk> <20070423025631.GA33256@steerpike.hanley.stade.co.uk> <20070423113912.GE2052@deviant.kiev.zoral.com.ua> <462DDB4D.8080507@delphij.net> In-Reply-To: <462DDB4D.8080507@delphij.net> MIME-Version: 1.0 Content-Type: text/plain; charset=koi8-r Content-Transfer-Encoding: 8bit User-Agent: IMP/PHP IMAP webmail program 2.2.8 X-Originating-IP: 80.82.33.58 Cc: freebsd-stable@freebsd.org Subject: Re: 6.2-STABLE deadlock? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 24 Apr 2007 19:23:36 -0000 Цитирую LI Xin : > Kostik Belousov wrote: > > On Mon, Apr 23, 2007 at 03:56:32AM +0100, Adrian Wontroba wrote: > >> On Tue, Mar 13, 2007 at 02:08:48PM +0000, Adrian Wontroba wrote: > >>> At work, amoungst my stable of old computers running FreeBSD, I have > a > >>> Fujitsu M800 - a 4 Zeon SMP processor with 4 GB of memory. This > >>> primarily runs Nagios and a small and lightly used MySQL database, > along > >>> with a few inbound FTP transfers per minute. It has a Mylex card > based > >>> disc subsystem, ruling out crash dumps. > >>> > >>> At some point during 5.5-STABLE this machine started to occasionally > hang ... > >> Another 6-STABLE (cvsupped on 27/03/07) example, with diagnostics > taken > >> rather sooner after the hang. Processes with wmesg=ufs feature often > in > >> the ps output. > >> > >> http://www.stade.co.uk/crash1/ > > > > I would suspect the mlx controller. There is several processes (for > instance, > > 988, 50918) waiting for completion of block read, and processes in the > "ufs" > > states are the result of the lock cascade, IMHO. > > I'm not very sure if this is specific to one disk controller. Actually > I got some occasional reports about similar hangs on amd64 6.2-RELEASE > (slightly patched version) that most of processes stuck in the 'ufs' > state, under very light load, the box was equipped with amr(4) RAID. > > I was not able to reproduce the problem at my lab, though, it's still > unknown that how to trigger the livelock :-( Still need some > investigate on their production system. I reported simular issue for FreeBSD 6.2 in audit-trail for kern/104406: http://www.freebsd.org/cgi/query-pr.cgi?pr=104406&cat= and there should be a thread related to this. Briefly, I suspects that this is related to nullfs filesystems on my server and when I cvsuped to FreeBSD 6.2- STABLE with Daichi's unionfs-related patches and replaced nullfs-mounted fs with unionfs-mounted (that was done 10.03.07) problem is gone (seems to be so, at least).