From owner-freebsd-stable@FreeBSD.ORG Thu May 7 16:02:12 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 978AF106566B for ; Thu, 7 May 2009 16:02:12 +0000 (UTC) (envelope-from riccardo.torrini@esaote.com) Received: from gw-fi.esaote.com (gw-fi.esaote.com [85.18.189.242]) by mx1.freebsd.org (Postfix) with ESMTP id E7D838FC15 for ; Thu, 7 May 2009 16:02:11 +0000 (UTC) (envelope-from riccardo.torrini@esaote.com) Received: from tiger.fi.esaote.it (tiger.fi.esaote.it [192.168.6.66]) by gw-fi.esaote.com (8.14.3/8.14.3) with ESMTP id n47FoClf071902; Thu, 7 May 2009 17:50:13 +0200 (CEST) (envelope-from riccardo.torrini@esaote.com) Received: from tiger.fi.esaote.it (localhost [127.0.0.1]) by tiger.fi.esaote.it (Postfix) with ESMTP id D08D31CC9A; Thu, 7 May 2009 17:50:12 +0200 (CEST) Received: by tiger.fi.esaote.it (Postfix, from userid 201) id B41AF1CC99; Thu, 7 May 2009 17:50:12 +0200 (CEST) Date: Thu, 7 May 2009 17:50:12 +0200 From: Riccardo Torrini To: freebsd-stable@freebsd.org Message-ID: <20090507155012.GW21112@tiger.fi.esaote.it> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.5.19 (2009-01-05) X-AV-Checked: ClamAV using ClamSMTP Cc: siedar@nplay.pl, scottl@freebsd.org, jhb@freebsd.org Subject: kern/130330: [mpt] [panic] Panic and reboot machine MPT ... X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 07 May 2009 16:02:12 -0000 I just submitted a follow-up to PR kern/130330 with the same info. Maybe I found the committed lines doing the crash. Please see PR for more detailed info (and cc: this thread to me). I restricted the time window of the problem doing (a lot of) build&install world from 2008.07 up to now (read last week). With 2008.07.28.17.00.00 (7.0-STABLE) works fine but with 2008.07.28.18.00.00 start crashing removing the the second disk of a mirror (when the mirror is ok) or adding the second disk of a degraded ones. Also note that the same crash happens with all 7.1 stable or release and even all 7.2-PRE I tested. (wrapping long lines) # cd /home/ncvs/src/sys/ # grep -R "date.*2008\.07\.28\.17" ./ | grep -v /Attic ./dev/wi/if_wi.c,v: date 2008.07.28.17.00.37; author imp; state Exp; ./dev/wi/if_wivar.h,v: date 2008.07.28.17.00.37; author imp; state Exp; ./dev/mpt/mpt_raid.c,v: date 2008.07.28.17.10.09; author jhb; state Exp; ./dev/mpt/mpt_raid.c,v: date 2008.07.28.17.05.09; author jhb; state Exp; ./kern/sched_4bsd.c,v: date 2008.07.28.17.25.24; author jhb; state Exp; ./modules/et/Makefile,v: date 2008.07.28.17.56.37; author antoine; state Exp; In that time window there are only 4 file changed in src/sys/dev, and I bet to mpt_raid.c :-) This is the commit log extracted from cvsweb -----8<----- Revision 1.15.2.1: Mon Jul 28 17:05:09 2008 UTC (9 months, 1 week ago) by jhb Branches: RELENG_7 CVS tags: RELENG_7_1_BP Branch point for: RELENG_7_1 Diff to: previous 1.15: preferred, colored Changes since revision 1.15: +4 -4 lines SVN rev 180920 on 2008-07-28 17:05:09Z by jhb MFC: Allocate a single CCB at the start of the main loop of the RAID monitoring kthread of the mpt(4) driver. -----8<----- Here are the diff: http://www.freebsd.org/cgi/cvsweb.cgi/src/sys/dev/mpt/mpt_raid.c.diff?r1=1.15;r2=1.15.2.1 What can I do now? -- Riccardo. Network Manager @ ESAOTE S.p.A.