From owner-freebsd-stable@FreeBSD.ORG Tue Jul 30 02:48:39 2013 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTP id 6AC90D2C; Tue, 30 Jul 2013 02:48:39 +0000 (UTC) (envelope-from jdavidlists@gmail.com) Received: from mail-ob0-x22c.google.com (mail-ob0-x22c.google.com [IPv6:2607:f8b0:4003:c01::22c]) (using TLSv1 with cipher ECDHE-RSA-RC4-SHA (128/128 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id 26DF52E07; Tue, 30 Jul 2013 02:48:39 +0000 (UTC) Received: by mail-ob0-f172.google.com with SMTP id uz6so7401382obc.31 for ; Mon, 29 Jul 2013 19:48:38 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type; bh=W6NhDhNVjLNOd+KkPTQjoeA/dSYi6vg+ffqmSBznNIs=; b=STS9sL4bu4mMrVFLM6OCQ93OYkDo1E5y0UBrdN/tKwb2uOKJ2ggxNLK2AbH0qmIdsa okyvCKhaa4J4vdFvzL33GE2jukshh4fAXkF5Ovv2INT3+fHhZH3F9xPqB4UoJwDi12zw b6M6VP7xHDe6Wtgfzt7FUULRVEkvFvinKCwXO1D3Y6KK1H9D6TLAgMvlf+9MddLiPhbG Ruz7dT2KLDBt7icw62Jw7L65bB3L62SJpqRrOLCLvmXdKTz4fkC9XevGRRmzisASdP1M qlV6LVE9krzprmMjZvvTEHBr2lUe+XdSriYgifDwN+JkUGju2gKJxRCcX6FWDUoOrNqQ xoIQ== MIME-Version: 1.0 X-Received: by 10.42.215.11 with SMTP id hc11mr9160icb.9.1375152517946; Mon, 29 Jul 2013 19:48:37 -0700 (PDT) Sender: jdavidlists@gmail.com Received: by 10.42.114.73 with HTTP; Mon, 29 Jul 2013 19:48:37 -0700 (PDT) In-Reply-To: <1710471570.3603170.1375141032147.JavaMail.root@uoguelph.ca> References: <1710471570.3603170.1375141032147.JavaMail.root@uoguelph.ca> Date: Mon, 29 Jul 2013 22:48:37 -0400 X-Google-Sender-Auth: WE67Cdk9uHIcpd-k59zue2OmcJ0 Message-ID: Subject: Re: NFS deadlock on 9.2-Beta1 From: J David To: Rick Macklem Content-Type: text/plain; charset=ISO-8859-1 Cc: Konstantin Belousov , Steven Hartland , freebsd-stable@freebsd.org, re , Michael Tratz X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 30 Jul 2013 02:48:39 -0000 If it is helpful, we have 25 nodes testing the 9.2-BETA1 build and without especially trying to exercise this bug, we found sendfile()-using processes deadlocked in WCHAN newnfs on 5 of the 25 nodes. The ones with highest uptime (about 3 days) seem most affected, so it does seem like a "sooner or later" type of thing. Hopefully the fix is easy and it won't be an issue, but it definitely does seem like a problem 9.2-RELEASE would be better off without. Unfortunately we are not in a position to capture the requested debugging information at this time; none of those nodes are running a debug version of the kernel. If Michael is unable to get the information as he hopes, we can try to do that, possibly over the weekend. For the time being, we will convert half the machines to rollback r250907 to try to confirm that resolves the issue. Thanks all! If one has to encounter a problem like this, it is nice to come to the list and find the research already so well underway!