From owner-freebsd-current@FreeBSD.ORG Wed Oct 25 19:46:22 2006 Return-Path: X-Original-To: freebsd-current@freebsd.org Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id A001616A416; Wed, 25 Oct 2006 19:46:22 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from server.baldwin.cx (66-23-211-162.clients.speedfactory.net [66.23.211.162]) by mx1.FreeBSD.org (Postfix) with ESMTP id AC03A43D7B; Wed, 25 Oct 2006 19:46:17 +0000 (GMT) (envelope-from jhb@freebsd.org) Received: from localhost.corp.yahoo.com (john@localhost [127.0.0.1]) (authenticated bits=0) by server.baldwin.cx (8.13.6/8.13.6) with ESMTP id k9PJk9k6020493; Wed, 25 Oct 2006 15:46:15 -0400 (EDT) (envelope-from jhb@freebsd.org) From: John Baldwin To: Peter Holm Date: Wed, 25 Oct 2006 15:45:52 -0400 User-Agent: KMail/1.9.1 References: <20061023084839.GA4523@peter.osted.lan> <200610251333.42114.jhb@freebsd.org> <20061025192238.GA25846@peter.osted.lan> In-Reply-To: <20061025192238.GA25846@peter.osted.lan> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200610251545.53454.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH authentication, not delayed by milter-greylist-2.0.2 (server.baldwin.cx [127.0.0.1]); Wed, 25 Oct 2006 15:46:15 -0400 (EDT) X-Virus-Scanned: ClamAV 0.88.3/2098/Wed Oct 25 09:14:20 2006 on server.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-4.4 required=4.2 tests=ALL_TRUSTED,AWL,BAYES_00 autolearn=ham version=3.1.3 X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on server.baldwin.cx Cc: freebsd-current@freebsd.org Subject: Re: Deadlock while testing on a 64 MB filesystem X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 25 Oct 2006 19:46:22 -0000 On Wednesday 25 October 2006 15:22, Peter Holm wrote: > On Wed, Oct 25, 2006 at 01:33:41PM -0400, John Baldwin wrote: > > On Monday 23 October 2006 04:48, Peter Holm wrote: > > > After some prodding by phk@ I made a test for problems seen with > > > "newfs -b 32768 -f 4096". > > > > > > http://people.freebsd.org/~pho/stress/log/cons218.html > > > > > > I was using these watchdog options: -t 900 -e 'ls /tmp /dev > > > /mnt > /dev/null; true' -s 60 and /mnt was the mount point for the test > > > filesystem. > > > > Looks like the root is held by 97785: > > > > 0xc61292a0: tag ufs, type VDIR > > usecount 35, writecount 0, refcount 39 mountedhere 0 > > flags (VV_ROOT) > > v_object 0xc5c2f9d8 ref 0 pages 1 > > lock type ufs: EXCL (count 1) by thread 0xc47ea6c0 (pid 97785) with 12 > > pending > > > > And it is blocked on another lockmgr lock: > > > > 97785 997 97785 1001 S+ getblk 0xd7f43588 ls > > > > lockmgr(d7f43588,202122,c6129368,c47ea6c0) at lockmgr+0x46e > > getblk(c61292a0,0,0,1000,0,...) at getblk+0x12f > > breadn(c61292a0,0,0,1000,0,...) at breadn+0x2f > > bread(c61292a0,0,0,1000,0,...) at bread+0x20 > > ffs_read(e69f7bac) at ffs_read+0x23f > > > > Can you find the bp in getblk and print out the associated lock? I'm guessing > > it's share locked by someone else? > > > > I have updated cons218 with what I hope is the requested info. Ah, it's got one of those "bogus" owners: lk_lockholder = 0xfffffffe aka LK_KERNPROC. I don't grok enough buf/bio stuff to dive into this further. Maybe phk (cc'd) can help. -- John Baldwin