From owner-freebsd-stable@FreeBSD.ORG Wed Jul 15 13:48:24 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C34A51065689; Wed, 15 Jul 2009 13:48:24 +0000 (UTC) (envelope-from ianjhart@ntlworld.com) Received: from mtaout02-winn.ispmail.ntl.com (mtaout02-winn.ispmail.ntl.com [81.103.221.48]) by mx1.freebsd.org (Postfix) with ESMTP id F1B558FC12; Wed, 15 Jul 2009 13:48:23 +0000 (UTC) (envelope-from ianjhart@ntlworld.com) Received: from aamtaout02-winn.ispmail.ntl.com ([81.103.221.35]) by mtaout02-winn.ispmail.ntl.com (InterMail vM.7.08.04.00 201-2186-134-20080326) with ESMTP id <20090715134819.TTVI6611.mtaout02-winn.ispmail.ntl.com@aamtaout02-winn.ispmail.ntl.com>; Wed, 15 Jul 2009 14:48:19 +0100 Received: from cpc1-cove3-0-0-cust909.sol2.cable.ntl.com ([86.20.31.142]) by aamtaout02-winn.ispmail.ntl.com (InterMail vG.2.02.00.01 201-2161-120-102-20060912) with ESMTP id <20090715134819.OTXP21638.aamtaout02-winn.ispmail.ntl.com@cpc1-cove3-0-0-cust909.sol2.cable.ntl.com>; Wed, 15 Jul 2009 14:48:19 +0100 X-Virus-Scanned: amavisd-new at omega.private.lan Received: from localhost (localhost [127.0.0.1]) by cpc1-cove3-0-0-cust909.sol2.cable.ntl.com (8.14.3/8.14.3) with ESMTP id n6FDlvLv006949; Wed, 15 Jul 2009 14:47:57 +0100 (BST) (envelope-from ianjhart@cpc1-cove3-0-0-cust909.sol2.cable.ntl.com) Received: from localhost (localhost [127.0.0.1]) by 10.248.192.16 (Horde Framework) with HTTP; Wed, 15 Jul 2009 14:47:56 +0100 Message-ID: <20090715144756.107076ji8zo2oh0k@10.248.192.16> Date: Wed, 15 Jul 2009 14:47:56 +0100 From: Ian J Hart To: John Baldwin References: <20090714145101.60515vh49s7o7c4k@10.248.192.16> <200907141300.16634.jhb@freebsd.org> In-Reply-To: <200907141300.16634.jhb@freebsd.org> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; DelSp="Yes"; format="flowed" Content-Disposition: inline Content-Transfer-Encoding: 7bit User-Agent: Internet Messaging Program (IMP) 4.3.3 / FreeBSD-7.2 X-Spam-Status: No, score=-1.4 required=5.0 tests=ALL_TRUSTED autolearn=failed version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on cpc1-cove3-0-0-cust909.sol2.cable.ntl.com X-Cloudmark-Analysis: v=1.0 c=1 a=6I5d2MoRAAAA:8 a=NLZqzBF-AAAA:8 a=pu2GmGBiriTffNN3MroA:9 a=ZeEn40FFaf4qDjDoOo0A:7 a=Kx3brIwj9YRegh4A7SqITyVILFgA:4 a=SV7veod9ZcQA:10 a=_dQi-Dcv4p4A:10 a=70Ui9NTFQqM8-blI:21 a=sXwrx7q-aWA_qakX:21 Cc: freebsd-stable@freebsd.org Subject: Re: trap 12 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 15 Jul 2009 13:48:25 -0000 Quoting John Baldwin : > On Tuesday 14 July 2009 9:51:01 am Ian J Hart wrote: >> Quoting John Baldwin : >> >> > On Tuesday 07 July 2009 5:51:03 am Ian J Hart wrote: >> >> Quoting Ian J Hart : >> >> >> >>> Quoting Ian J Hart : >> >>> >> >>>> Is this likely to be hardware? Details will follow if not. >> >>>> >> >>>> [copied from a screen dump] >> >>>> >> >>>> Fatal trap 12: page fault while in kernel mode >> >>>> cpuid = 1; apic id = 01 >> >>>> fault virtual address = 0x0 >> >>>> fault code = supervisor write data, page not present >> >>>> instruction pointer = 0x8:0xffffffff807c6c12 >> >>>> stack pointer = 0x10:0xffffffff510e7890 >> >>>> frame pointer = 0x10:0xffffff00054a6c90 >> >>>> code segment = base 0x0, limit 0xfffff, type 0x1b >> >>>> = DPL 0, pres 1, long 1 def32 0, gran 1 >> >>>> processor eflags = interrupt enabled, resume, IOPL = 0 >> >>>> current process = 75372 (printf) >> >>>> trap number = 12 >> >>>> panic: page fault >> >>>> cpuid = 1 >> >>>> uptime: 8m2s >> >>>> Cannot dump. No dump device defined. >> >>>> >> >>>> >> >>> Ran crashinfo, now have much more info than I need ;) >> >>> >> >>> Starting another portupgrade run now to see how reproducable this is. >> >>> >> >>> Later BIOS waiting in USB floppy. >> >>> >> >> [snip dmesg] >> >> >> >> It took 2 runs of portupgrade -af.Some corruption in the dbs may have >> >> to pkg_delete -a. >> >> >> >> FreeBSD * 7.2-RELEASE-p1 FreeBSD 7.2-RELEASE-p1 #0: Tue Jun 16 >> >> 18:03:10 BST 2009 *@*:/usr/obj/usr/src/sys/GENERIC amd64 >> >> >> >> panic: page fault >> >> >> >> GNU gdb 6.1.1 [FreeBSD] >> >> Copyright 2004 Free Software Foundation, Inc. >> >> GDB is free software, covered by the GNU General Public License, and you > are >> >> welcome to change it and/or distribute copies of it under certain >> > conditions. >> >> Type "show copying" to see the conditions. >> >> There is absolutely no warranty for GDB. Type "show warranty" for > details. >> >> This GDB was configured as "amd64-marcel-freebsd"... >> >> >> >> Unread portion of the kernel message buffer: >> >> >> >> >> >> Fatal trap 12: page fault while in kernel mode >> >> cpuid = 1; apic id = 01 >> >> fault virtual address = 0xfffffffff5555570 >> >> fault code = supervisor write data, page not present >> >> instruction pointer = 0x8:0xffffffff807c429b >> >> stack pointer = 0x10:0xffffffff511e4710 >> >> frame pointer = 0x10:0x20 >> >> code segment = base 0x0, limit 0xfffff, type 0x1b >> >> = DPL 0, pres 1, long 1, def32 0, gran 1 >> >> processor eflags = interrupt enabled, resume, IOPL = 0 >> >> current process = 69996 (mkdir) >> >> trap number = 12 >> >> panic: page fault >> > >> > This one does look like a hardware issue from the stack trace. It's hard > to >> > know if the first panic you saw was a hardware issue as well without the >> > stack trace information. >> > >> >> #7 0xffffffff807b706e in calltrap () >> >> at /usr/src/sys/amd64/amd64/exception.S:209 >> >> #8 0xffffffff807c429b in free_pv_entry (pmap=0xffffffff80b66c80, >> >> pv=Variable "pv" is not available. >> >> ) >> >> at /usr/src/sys/amd64/amd64/pmap.c:1905 >> >> #9 0xffffffff807c4403 in pmap_remove_entry (pmap=Variable "pmap" is >> >> not available. >> >> ) >> >> at /usr/src/sys/amd64/amd64/pmap.c:2131 >> >> #10 0xffffffff807c6447 in pmap_remove_pte (pmap=0xffffffff80b66c80, >> >> ptq=0xaaaaaaa8, va=18446744070506639360, ptepde=23601251, >> >> free=0xffffffff511e4790) at /usr/src/sys/amd64/amd64/pmap.c:2366 >> >> #11 0xffffffff807cab87 in pmap_remove (pmap=0xffffffff80b66c80, >> >> sva=18446744070506639360, eva=18446744070506909696) >> >> at /usr/src/sys/amd64/amd64/pmap.c:2510 >> > >> > -- >> > John Baldwin >> > >> >> The remote backup continues to run so there was definitely some issue >> there. No more reboots, but it wasn't doing that regularly without >> some additional load. >> >> Hopefully I can swap parts around until I find the offending item. >> >> Thanks for your input. > > I would try running memtest86 to check your RAM. > > -- > John Baldwin > It's ECC and was extensively tested when new so I suspect it's something else, but I'll memtest it overnight anyway. Sat at a panic when I went to put the floppy in.No debugger prompt and unresponsive so I power cycled it :( -- ian j hart ---------------------------------------------------------------- This message was sent using IMP, the Internet Messaging Program.