From owner-freebsd-stable@FreeBSD.ORG Tue Aug 20 04:40:25 2013 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTP id 70593AE4; Tue, 20 Aug 2013 04:40:25 +0000 (UTC) (envelope-from jdavidlists@gmail.com) Received: from mail-ie0-x22b.google.com (mail-ie0-x22b.google.com [IPv6:2607:f8b0:4001:c03::22b]) (using TLSv1 with cipher ECDHE-RSA-RC4-SHA (128/128 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id 385E42EB5; Tue, 20 Aug 2013 04:40:25 +0000 (UTC) Received: by mail-ie0-f171.google.com with SMTP id 16so1057813iea.2 for ; Mon, 19 Aug 2013 21:40:24 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:cc:content-type; bh=JGS9z6ozRnbBOnEtf0oVkEfZTl2B3dCSrWjx7M20Chw=; b=H/t0ux/o0b3UyWAMAWdxSvUG7hvhFSneANLpTuVOffqDkVEe3mMW0GYG9YxT7r0Fx8 pjTMyGKO79hEbKeBpiMm6KrtQdZ/JL0uN5FTqXRO9hnzSrLjgveb5W/e5/7LBY23Ixuo pdGbMziZq12rHKf0t89ILKf4oixSO5oza1UMfZsPU/FTlynqyZKltPNA+4Qzm8dZVUIr rAi4abPLOZv9GymWXnxRjNWtvuL8sDfU3mB3mjJKo+mXdalZvPM0Cj6ikga2/x+oO8uR CR6CPq4LZfwETZiEo8Uh460xnAP/5AdiC2We5I/vQRcuyiCDhKI1VXz3TKovM5xyyQDy 3RyQ== MIME-Version: 1.0 X-Received: by 10.42.147.198 with SMTP id o6mr3107987icv.13.1376973624776; Mon, 19 Aug 2013 21:40:24 -0700 (PDT) Sender: jdavidlists@gmail.com Received: by 10.42.150.196 with HTTP; Mon, 19 Aug 2013 21:40:24 -0700 (PDT) In-Reply-To: <461392652.9990692.1376602743970.JavaMail.root@uoguelph.ca> References: <461392652.9990692.1376602743970.JavaMail.root@uoguelph.ca> Date: Tue, 20 Aug 2013 00:40:24 -0400 X-Google-Sender-Auth: PB-_FbKQXM7TBbXerJKC8w6AX5E Message-ID: Subject: Re: NFS deadlock on 9.2-Beta1 From: J David To: Rick Macklem Content-Type: text/plain; charset=ISO-8859-1 Cc: Konstantin Belousov , scottl , freebsd-stable , Michael Tratz , Steven Hartland X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 20 Aug 2013 04:40:25 -0000 On Thu, Aug 15, 2013 at 5:39 PM, Rick Macklem wrote: > Have you been able to pass the debugging info on to Kostik? > > It would be really nice to get this fixed for FreeBSD9.2. You're probably not talking to me, but headway here is slow. At our location, we have been continuing to test releng/9.2 extensively, but with r250907 reverted. Since reverting it solves the issue, and since there haven't been any further changes to releng/9.2 that might also resolve this issue, re-applying r250907 is perceived here as un-fixing a problem. Enthusiasm for doing so is correspondingly low, even if the purpose is to gather debugging info. :( However, after finally having clearance to test releng/9.2 r254540 with r250907 included and with DDB on five nodes. The problem cropped up in about an hour. Two threads in one process deadlocked, was perfect. Got it into DDB and saw the stack trace was scrolling off so there was no way to copy it by hand. Also, the machine's disk is smaller than physical RAM, so no dump file. :( Here's what is available so far: db> show proc 33362 Process 33362 (httpd) at 0xcd225b50: state: NORMAL uid: 25000 gids: 25000 parent: pid 25104 at 0xc95f92d4 ABI: FreeBSD ELF32 arguments: /usr/local/libexec/httpd threads: 3 100405 D newnfs 0xc9b875e4 httpd 100393 D pgrbwt 0xc43a30c0 httpd 100755 S uwait 0xc84b7c80 httpd Not much to go on. :( Maybe these five can be configured with serial consoles. So, inquiries are continuing, but the answer to "does this still happen on 9.2-RC2?" is definitely yes. Thanks!