From owner-freebsd-stable@FreeBSD.ORG Fri Feb 13 15:40:10 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id D779010656CE for ; Fri, 13 Feb 2009 15:40:10 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 959628FC21 for ; Fri, 13 Feb 2009 15:40:10 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from server.baldwin.cx (pool-98-109-39-197.nwrknj.fios.verizon.net [98.109.39.197]) by cyrus.watson.org (Postfix) with ESMTPSA id 3DAEF46B5C; Fri, 13 Feb 2009 10:40:10 -0500 (EST) Received: from localhost (john@localhost [127.0.0.1]) (authenticated bits=0) by server.baldwin.cx (8.14.3/8.14.3) with ESMTP id n1DFd49P044500; Fri, 13 Feb 2009 10:40:04 -0500 (EST) (envelope-from jhb@freebsd.org) From: John Baldwin To: freebsd-stable@freebsd.org Date: Fri, 13 Feb 2009 10:18:37 -0500 User-Agent: KMail/1.9.7 References: In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200902131018.37940.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH authentication, not delayed by milter-greylist-2.0.2 (server.baldwin.cx [127.0.0.1]); Fri, 13 Feb 2009 10:40:04 -0500 (EST) X-Virus-Scanned: ClamAV 0.94.2/8987/Fri Feb 13 05:32:33 2009 on server.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-4.4 required=4.2 tests=ALL_TRUSTED,AWL,BAYES_00 autolearn=ham version=3.1.3 X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on server.baldwin.cx Cc: Charles Sprickman Subject: Re: 7.1 Panic on degraded disk w/mpt X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 13 Feb 2009 15:40:11 -0000 On Monday 09 February 2009 1:13:08 am Charles Sprickman wrote: > Howdy, > > I dug around and can't find a PR on this, and the only other report I saw > was in this mailing list post that has no replies: > > http://www.nabble.com/7.1-BETA2-panic-on-mpt-degrade-td20183173.html > > The hardware is a Dell PowerEdge 860 with the Dell/LSI SAS5 controller: > mpt0: port 0xec00-0xecff mem > 0xfe9fc000-0xfe9fffff,0xfe9e0000-0xfe9effff irq 16 at device 8.0 on pci2 > mpt0: MPI Version=1.5.13.0 > > The panic is repeatable by forcing the array into a degraded state. > > Here's my best shot at getting info out of kgdb: > > [root@uniweb /home/spork]# cd /usr/obj/usr/src/sys/BWAY7/ > [root@uniweb /usr/obj/usr/src/sys/BWAY7]# kgdb kernel.debug > /var/crash/vmcore.0 GNU gdb 6.1.1 [FreeBSD] > Copyright 2004 Free Software Foundation, Inc. > GDB is free software, covered by the GNU General Public License, and you > are welcome to change it and/or distribute copies of it under certain > conditions. > Type "show copying" to see the conditions. > There is absolutely no warranty for GDB. Type "show warranty" for > details. > This GDB was configured as "i386-marcel-freebsd"... > > Unread portion of the kernel message buffer: > > > Fatal trap 12: page fault while in kernel mode > cpuid = 0; apic id = 00 > fault virtual address = 0x14 > fault code = supervisor read, page not present > instruction pointer = 0x20:0xc044b09b > stack pointer = 0x28:0xe6ee5b80 > frame pointer = 0x28:0xe6ee5b9c > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, def32 1, gran 1 > processor eflags = interrupt enabled, resume, IOPL = 0 > current process = 17 (swi2: cambio) > trap number = 12 > panic: page fault > cpuid = 0 > Uptime: 3m7s > Physical memory: 3575 MB > Dumping 94 MB: 79 63 47 31 15 > > Reading symbols from /boot/kernel/acpi.ko...Reading symbols from > /boot/kernel/acpi.ko.symbols...done. > done. > Loaded symbols for /boot/kernel/acpi.ko > #0 doadump () at pcpu.h:196 > 196 __asm __volatile("movl %%fs:0,%0" : "=r" (td)); > (kgdb) list *0xc044b09b > 0xc044b09b is in xpt_done (/usr/src/sys/cam/cam_xpt.c:4832). > 4827 if ((done_ccb->ccb_h.func_code & XPT_FC_QUEUED) != 0) { > 4828 /* > 4829 * Queue up the request for handling by our SWI handler > 4830 * any of the "non-immediate" type of ccbs. > 4831 */ > 4832 sim = done_ccb->ccb_h.path->bus->sim; > 4833 switch (done_ccb->ccb_h.path->periph->type) { > 4834 case CAM_PERIPH_BIO: > 4835 TAILQ_INSERT_TAIL(&sim->sim_doneq, &done_ccb->ccb_h, > 4836 sim_links.tqe); Can you 'p done_ccb->ccb_h.path' and if that is not 0, 'p done_ccb->ccb_h.path.bus'? -- John Baldwin