Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 14 May 2009 09:17:53 -0700 (PDT)
From:      Chris Timmons <cwt@networks.cwu.edu>
To:        John Baldwin <jhb@freebsd.org>
Cc:        stable@freebsd.org, freebsd-stable@freebsd.org, Martin Sugioarto <nakal@web.de>
Subject:   Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R]
Message-ID:  <20090514091410.H12558@n.cwu.edu>
In-Reply-To: <200905140916.40594.jhb@freebsd.org>
References:  <1696198956@web.de> <200905140916.40594.jhb@freebsd.org>

next in thread | previous in thread | raw e-mail | index | archive | help

Yesterday I updated a rock-solid machine (uptime hundreds of days) from 
7-stable circa July, 2008, to the latest stable.   I run Nessus on this 
machine, with about 60 concurrent scans.  It pushes the load average up as 
high as 20 for short periods of time, but overall is reasonably efficient.

I have never had the box become unresponsive, let alone crash, under any 
load scenario.

This morning, I ran my first scan on 7.2-stable, with Nessus 4.0.  It 
lasted about 30 seconds before:


Fatal trap 12: page fault while in kernel mode
cpuid = 2; apic id = 06
fault virtual address	= 0x1c
fault code		= supervisor read, page not present
instruction pointer	= 0x20:0xc07a4dac
stack pointer	        = 0x28:0xee156ad4
frame pointer	        = 0x28:0xee156ad8
code segment		= base 0x0, limit 0xfffff, type 0x1b
 			= DPL 0, pres 1, def32 1, gran 1
processor eflags	= interrupt enabled, resume, IOPL = 0
current process		= 5263 (nessusd)
trap number		= 12
panic: page fault
cpuid = 3
Uptime: 17h22m15s
Physical memory: 3826 MB
Dumping 329 MB: 314 298 282 266 250 234 218 202 186 170 154 138 122 106 90 
74 58 42 26 10
Dump complete
aac0: shutting down controller...done
Automatic reboot in 15 seconds - press a key on the console to abort
Rebooting...
cpu_reset: Stopping other CPUs

-c

On Thu, 14 May 2009, John Baldwin wrote:

> Given that that is a single bit set, it could possibly be due to bad RAM.
> Does your kernel have debug symbols?  If so, running 'l *0xffffffff80186249'
> (from the 'instruction pointer' line in the fault message) would be helpful.
>
>> fault code = supervisor write data, page not present
>> instruction pointer = 0x8:0xffffffff80186249
>> stack pointer = 0x10:0xffffffff8065f200
>> frame pointer = 0x10:0x36ee7f
>> code segment = base 0x0, limit 0xfffff, type 0x1b
>>              = DPL 0, pres 1, long 1, def32 0, gran 1
>> processor eflags = resume, IOPL = 0
>> current process = 26 (irq256: bge0)
>> trap number = 12
>> p[*CURSOR STOPPED HERE*]
>
> -- 
> John Baldwin
> _______________________________________________
> freebsd-stable@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-stable
> To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org"
>



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20090514091410.H12558>