Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 8 Aug 2000 16:22:26 +0000
From:      dl@leo.org
To:        FreeBSD-gnats-submit@freebsd.org
Subject:   kern/20484: FreeBSD 4.0 crashes repeatedly: trap 12: page fault while in kernel mode
Message-ID:  <20000808142225.2B98F17408@atleo3.leo.org>

next in thread | raw e-mail | index | archive | help

>Number:         20484
>Category:       kern
>Synopsis:       FreeBSD 4.0 crashes repeatedly: trap 12: page fault while in kernel mode
>Confidential:   no
>Severity:       critical
>Priority:       medium
>Responsible:    freebsd-bugs
>State:          open
>Quarter:        
>Keywords:       
>Date-Required:
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Tue Aug 08 07:30:01 PDT 2000
>Closed-Date:
>Last-Modified:
>Originator:     Daniel Lang
>Release:        FreeBSD 4.0-STABLE i386
>Organization:
TU Muenchen
>Environment:

Hardware configuration:

Athlon 700 on ASUS K7V board, BIOS Revision 1005
256 MB ECC RAM (due to BIOS problems ECC mode disabled),
Adaptec 29160, 3com 905BX NIC, remaining configuration
see following dmesg output:

Relevant dmesg output:

Copyright (c) 1992-2000 The FreeBSD Project.
Copyright (c) 1982, 1986, 1989, 1991, 1993
        The Regents of the University of California. All rights reserved.
FreeBSD 4.0-STABLE #0: Fri Jul  7 12:36:02 CEST 2000
    root@atleo3.leo.org:/usr/obj/usr/src/sys/ATLEO3
Timecounter "i8254"  frequency 1193182 Hz
CPU: AMD Athlon(tm) Processor (700.03-MHz 686-class CPU)
  Origin = "AuthenticAMD"  Id = 0x621  Stepping = 1
  Features=0x183f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PA
T,PSE36,MMX,FXSR>
  AMD Features=0xc0400000<AMIE,DSP,3DNow!>
real memory  = 268419072 (262128K bytes)
avail memory = 256929792 (250908K bytes)
Preloaded elf kernel "kernel" at 0xc03b6000.
Pentium Pro MTRR support enabled
md0: Malloc disk
npx0: <math processor> on motherboard
npx0: INT 16 interface
pcib0: <Host to PCI bridge> on motherboard
pci0: <PCI bus> on pcib0
pcib2: <VIA 82C598MVP (Apollo MVP3) PCI-PCI (AGP) bridge> at device 1.0 on pci0
pci1: <PCI bus> on pcib2
pci1: <ATI Mach64-GB graphics accelerator> at 0.0 irq 11
isab0: <VIA 82C686 PCI-ISA bridge> at device 4.0 on pci0
isa0: <ISA bus> on isab0
pci0: <VIA Apollo ATA controller> at 4.1
uhci0: <VIA 83C572 USB controller> port 0xb400-0xb41f irq 15 at device 4.2 on pc
i0
usb0: <VIA 83C572 USB controller> on uhci0
usb0: USB revision 1.0
uhub0: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhub0: port 1 power on failed, IOERROR
uhub0: port 2 power on failed, IOERROR
uhci1: <VIA 83C572 USB controller> port 0xb000-0xb01f irq 15 at device 4.3 on pc
i0
usb1: <VIA 83C572 USB controller> on uhci1
usb1: USB revision 1.0
uhub1: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
uhub1: port 1 power on failed, IOERROR
uhub1: port 2 power on failed, IOERROR
xl0: <3Com 3c905B-TX Fast Etherlink XL> port 0x9400-0x947f mem 0xe1000000-0xe100
007f irq 12 at device 10.0 on pci0
xl0: Ethernet address: 00:a0:24:a6:86:04
miibus0: <MII bus> on xl0
xlphy0: <3Com internal media interface> on miibus0
xlphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
xl0: supplying EUI64: 00:a0:24:ff:fe:a6:86:04
ahc0: <Adaptec 29160 Ultra160 SCSI adapter> port 0x9000-0x90ff mem 0xe0800000-0x
e0800fff irq 10 at device 11.0 on pci0
ahc0: aic7892 Wide Channel A, SCSI Id=7, 16/255 SCBs
pcib1: <Host to PCI bridge> on motherboard
pci2: <PCI bus> on pcib1
fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
kbd0 at atkbd0
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <12 virtual consoles, flags=0x100>
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A, console
sio1 at port 0x2f8-0x2ff irq 3 on isa0
sio1: type 16550A
ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0
ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode
ppc0: FIFO with 16/16/8 bytes threshold
ppi0: <Parallel I/O> on ppbus0
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
plip0: <PLIP network interface> on ppbus0
IP packet filtering initialized, divert enabled, rule-based forwarding enabled, 
default to accept, logging limited to 100 packets/entry by default
IPsec: Initialized Security Association Processing.
IPv6 packet filtering initialized, default to accept, logging limited to 100 pac
kets/entry
IP Filter: initialized.  Default = pass all, Logging = enabled
IP Filter: v3.3.8
Waiting 3 seconds for SCSI devices to settle
Mounting root from ufs:/dev/da0s1a
da0 at ahc0 bus 0 target 1 lun 0
da0: <IBM DNES-309170W SA30> Fixed Direct Access SCSI-3 device 
da0: 80.000MB/s transfers (40.000MHz, offset 31, 16bit), Tagged Queueing Enabled
da0: 8748MB (17916240 512 byte sectors: 255H 63S/T 1115C)
WARNING: / was not properly dismounted

Kernel config file:


machine         i386
#cpu            I386_CPU
#cpu            I486_CPU
#cpu            I586_CPU
cpu             I686_CPU
ident           ATLEO3
maxusers        256

options         INET                    #InterNETworking
options         INET6                   #IPv6 communications protocols
options         IPSEC                   #IP security
options         IPSEC_ESP               #IP security (crypto; define w/ IPSEC)
options         IPSEC_DEBUG             #debug for IP security
options         MROUTING
options         IPFIREWALL              #firewall
options         IPFIREWALL_VERBOSE      #print information about
                                        # dropped packets
options         IPFIREWALL_FORWARD      #enable transparent proxy support
options         IPFIREWALL_VERBOSE_LIMIT=100    #limit verbosity
#options         IPFIREWALL_DEFAULT_TO_ACCEPT    #allow everything by default
options         IPV6FIREWALL            #firewall for IPv6
options         IPV6FIREWALL_VERBOSE
options         IPV6FIREWALL_VERBOSE_LIMIT=100
#options         IPV6FIREWALL_DEFAULT_TO_ACCEPT
options         IPDIVERT                #divert sockets
options         IPFILTER                #ipfilter support
options         IPFILTER_LOG            #ipfilter logging
options         IPSTEALTH               #support for stealth forwarding
options         TCPDEBUG
#options         TCP_DROP_SYNFIN         #drop TCP packets with SYN+FIN
options         TCP_RESTRICT_RST        #restrict emission of TCP RST

options         FFS                     #Berkeley Fast Filesystem
options         FFS_ROOT                #FFS usable as root device [keep this!]
options         SOFTUPDATES             #Enable FFS soft updates support
options         MFS                     #Memory Filesystem
options         MD_ROOT                 #MD is a potential root device
options         NFS                     #Network Filesystem
options         NFS_ROOT                #NFS usable as root device, NFS required
#options        MSDOSFS                 #MSDOS Filesystem
#options        CD9660                  #ISO 9660 Filesystem
#options        CD9660_ROOT             #CD-ROM usable as root, CD9660 required
#options        PROCFS                  #Process filesystem

options         COMPAT_43               #Compatible with BSD 4.3 [KEEP THIS!]
options         SCSI_DELAY=3000 #Delay (in ms) before probing SCSI
options         UCONSOLE                #Allow users to grab the console
options         USERCONFIG              #boot -c editor
options         VISUAL_USERCONFIG       #visual boot -c editor
options         KTRACE                  #ktrace(1) support
options         SYSVSHM                 #SYSV-style shared memory
options         SYSVMSG                 #SYSV-style message queues
options         SYSVSEM                 #SYSV-style semaphores
options         P1003_1B                #Posix P1003_1B real-time extensions
options         _KPOSIX_PRIORITY_SCHEDULING
options         ICMP_BANDLIM            #Rate limit bad replies
options         KBD_INSTALL_CDEV        # install a CDEV entry in /dev
options         NETGRAPH

device          isa
device          eisa
device          pci

# Floppy drives
device          fdc0    at isa? port IO_FD1 irq 6 drq 2
device          fd0     at fdc0 drive 0
device          fd1     at fdc0 drive 1

# SCSI Controllers
device          ahc0            # AHA2940 and onboard AIC7xxx devices

# SCSI peripherals
device          scbus           # SCSI bus (required)
device          da              # Direct Access (disks)
device          sa              # Sequential Access (tape etc)
device          cd              # CD
device          pass            # Passthrough device (direct SCSI access)

# disks
device          scbus0 at ahc0
device          da0 at scbus0 target 0

# atkbdc0 controls both the keyboard and the PS/2 mouse
device          atkbdc0 at isa? port IO_KBD
device          atkbd0  at atkbdc? irq 1 flags 0x1
device          psm0    at atkbdc? irq 12

device          vga0    at isa?

# splash screen/screen saver
pseudo-device   splash

# syscons is the default console driver, resembling an SCO console
device          sc0     at isa? flags 0x100
options         MAXCONS=12              # number of virtual consoles
options         SC_NORM_ATTR="(FG_LIGHTGREY|BG_BLACK)"
options         SC_NORM_REV_ATTR="(FG_YELLOW|BG_GREEN)"
options         SC_KERNEL_CONS_ATTR="(FG_WHITE|BG_BLUE)"
options         SC_KERNEL_CONS_REV_ATTR="(FG_BLACK|BG_RED)"
# Floating point support - do not disable.
device          npx0    at nexus? port IO_NPX irq 13

# Power management support (see LINT for more options)
device          apm0    at nexus? disable flags 0x20 # Advanced Power Management

# Serial (COM) ports
device          sio0    at isa? port IO_COM1 flags 0x10 irq 4
device          sio1    at isa? port IO_COM2 irq 3
device          sio2    at isa? disable port IO_COM3 irq 5
device          sio3    at isa? disable port IO_COM4 irq 9

# Parallel port
device          ppc0    at isa? irq 7
device          ppbus           # Parallel port bus (required)
device          lpt             # Printer
device          plip            # TCP/IP over parallel
device          ppi             # Parallel port interface device
#device         vpo             # Requires scbus and da

# PCI Ethernet NICs.
device          de              # DEC/Intel DC21x4x (``Tulip'')
device          fxp             # Intel EtherExpress PRO/100B (82557, 82558)
device          tx              # SMC 9432TX (83c170 ``EPIC'')
device          vx              # 3Com 3c590, 3c595 (``Vortex'')
device          wx              # Intel Gigabit Ethernet Card (``Wiseman'')

# PCI Ethernet NICs that use the common MII bus controller code.
device          miibus          # MII bus support
device          dc              # DEC/Intel 21143 and various workalikes
device          rl              # RealTek 8129/8139
device          sf              # Adaptec AIC-6915 (``Starfire'')
device          sis             # Silicon Integrated Systems SiS 900/SiS 7016
device          ste             # Sundance ST201 (D-Link DFE-550TX)
device          tl              # Texas Instruments ThunderLAN
device          vr              # VIA Rhine, Rhine II
device          wb              # Winbond W89C840F
device          xl              # 3Com 3c90x (``Boomerang'', ``Cyclone'')

# Pseudo devices - the number indicates how many units to allocated.
pseudo-device   loop            # Network loopback
pseudo-device   ether           # Ethernet support
pseudo-device   sl      1       # Kernel SLIP
pseudo-device   ppp     1       # Kernel PPP
pseudo-device   tun             # Packet tunnel.
pseudo-device   pty     256     # Pseudo-ttys (telnet etc)
pseudo-device   md              # Memory "disks"
pseudo-device   gif     4       # IPv6 and IPv4 tunneling
pseudo-device   faith   1       # IPv6-to-IPv4 relaying (translation)
pseudo-device   vn
pseudo-device   snp     4

# The `bpf' pseudo-device enables the Berkeley Packet Filter.
# Be aware of the administrative consequences of enabling this!
pseudo-device   bpf             #Berkeley packet filter

# USB support
device          uhci            # UHCI PCI->USB interface
device          ohci            # OHCI PCI->USB interface
device          usb             # USB Bus (required)
device          ugen            # Generic
device          uhid            # "Human Interface Devices"
device          ukbd            # Keyboard
device          ulpt            # Printer
device          umass           # Disks/Mass storage - Requires scbus and da
device          ums             # Mouse
# USB Ethernet, requires mii
device          aue             # ADMtek USB ethernet
device          cue             # CATC USB ethernet
device          kue             # Kawasaki LSI USB ethernet

Note to the ECC RAM:

Altough the DIMM's are ECC compliant (9 chip) the system/BIOS
locked up completely if the mode was set to ECC, and could
only be recovered by erasing CMOS with the soldering-pins,
so in this configuration ECC was not activated in BIOS.

I checked the DIMM's with boot-disk-based memtest-x86 2.3
which does a huge variety of extensive pattern tests of
the memory (with and without cache) and showed no error,
so I suspect it's not a hardware problem ?

>Description:

The machine crashed 4 times with this configuration,
always with the same reason (see below), but not
after some reproducible actions. The machine was up
more than 20 days in between, but then crashed 
twice in a few days.

The machine runs as a IRCnet Server in production, and
is therefore a primary DOS attack victim, but no
particular attack has been registered (on routers, etc)
when it crashed.

here are the last two kgdb traces:

1.

(no debugging symbols found)...
IdlePTD 3964928
initial pcb at 3266a0
panicstr: page fault
panic messages:
---
Fatal trap 12: page fault while in kernel mode
fault virtual address   = 0x20
fault code              = supervisor read, page not present
instruction pointer     = 0x8:0xc01bba4d
stack pointer           = 0x10:0xc02fb320
frame pointer           = 0x10:0xc02fb32c
code segment            = base 0x0, limit 0xfffff, type 0x1b
                        = DPL 0, pres 1, def32 1, gran 1
processor eflags        = interrupt enabled, resume, IOPL = 0
current process         = Idle
interrupt mask          = 
trap number             = 12
panic: page fault

syncing disks... 

Fatal trap 12: page fault while in kernel mode
fault virtual address   = 0x30
fault code              = supervisor read, page not present
instruction pointer     = 0x8:0xc0253680
stack pointer           = 0x10:0xc02fb158
frame pointer           = 0x10:0xc02fb15c
code segment            = base 0x0, limit 0xfffff, type 0x1b
                        = DPL 0, pres 1, def32 1, gran 1
processor eflags        = interrupt enabled, resume, IOPL = 0
current process         = Idle
interrupt mask          = bio 
trap number             = 12
panic: page fault
Uptime: 23d2h32m25s
dumping to dev #da/0x20001, offset 524320
dump 255 254 253 252 251 250 249 248 247 246 245 244 243 242 241 240 239 238 237 236 235 234 233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216 215 214 213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198 197 196 195 194 193 192 191 190 189 188 187 186 185 184 183 182 181 180 179 178 177 176 175 174 173 172 171 170 169 168 167 166 165 164 163 162 161 160 159 158 157 156 155 154 153 152 151 150 149 148 147 146 145 144 143 142 141 140 139 138 137 136 135 134 133 132 131 130 129 128 127 126 125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 108 107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
---
#0  0xc0161034 in boot ()
(kgdb) bt
#0  0xc0161034 in boot ()
#1  0xc01613b8 in poweroff_wait ()
#2  0xc02a77ad in trap_fatal ()
#3  0xc02a7485 in trap_pfault ()
#4  0xc02a7083 in trap ()
#5  0xc0253680 in acquire_lock ()
#6  0xc025735c in softdep_update_inodeblock ()
#7  0xc025296d in ffs_update ()
#8  0xc025a534 in ffs_sync ()
#9  0xc018d517 in sync ()
#10 0xc0160e07 in boot ()
#11 0xc01613b8 in poweroff_wait ()
#12 0xc02a77ad in trap_fatal ()
#13 0xc02a7485 in trap_pfault ()
#14 0xc02a7083 in trap ()
#15 0xc01bba4d in tcp_timer_persist ()
#16 0xc01667b1 in softclock ()

AND 2.

(no debugging symbols found)...
IdlePTD 3964928
initial pcb at 3266a0
panicstr: page fault
panic messages:
---
Fatal trap 12: page fault while in kernel mode
fault virtual address   = 0x20
fault code              = supervisor read, page not present
instruction pointer     = 0x8:0xc01bba4d
stack pointer           = 0x10:0xc02fb320
frame pointer           = 0x10:0xc02fb32c
code segment            = base 0x0, limit 0xfffff, type 0x1b
                        = DPL 0, pres 1, def32 1, gran 1
processor eflags        = interrupt enabled, resume, IOPL = 0
current process         = Idle
interrupt mask          = 
trap number             = 12
panic: page fault

syncing disks... 

Fatal trap 12: page fault while in kernel mode
fault virtual address   = 0x30
fault code              = supervisor read, page not present
instruction pointer     = 0x8:0xc0253680
stack pointer           = 0x10:0xc02fb158
frame pointer           = 0x10:0xc02fb15c
code segment            = base 0x0, limit 0xfffff, type 0x1b
                        = DPL 0, pres 1, def32 1, gran 1
processor eflags        = interrupt enabled, resume, IOPL = 0
current process         = Idle
interrupt mask          = bio 
trap number             = 12
panic: page fault
Uptime: 3d17h36m34s

dumping to dev #da/0x20001, offset 524320
dump 255 254 253 252 251 250 249 248 247 246 245 244 243 242 241 240 239 238 237 236 235 234 233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216 215 214 213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198 197 196 195 194 193 192 191 190 189 188 187 186 185 184 183 182 181 180 179 178 177 176 175 174 173 172 171 170 169 168 167 166 165 164 163 162 161 160 159 158 157 156 155 154 153 152 151 150 149 148 147 146 145 144 143 142 141 140 139 138 137 136 135 134 133 132 131 130 129 128 127 126 125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 108 107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
---
#0  0xc0161034 in boot ()
(kgdb) bt
#0  0xc0161034 in boot ()
#1  0xc01613b8 in poweroff_wait ()
#2  0xc02a77ad in trap_fatal ()
#3  0xc02a7485 in trap_pfault ()
#4  0xc02a7083 in trap ()
#5  0xc0253680 in acquire_lock ()
#6  0xc025735c in softdep_update_inodeblock ()
#7  0xc025296d in ffs_update ()
#8  0xc025a534 in ffs_sync ()
#9  0xc018d517 in sync ()
#10 0xc0160e07 in boot ()
#11 0xc01613b8 in poweroff_wait ()
#12 0xc02a77ad in trap_fatal ()
#13 0xc02a7485 in trap_pfault ()
#14 0xc02a7083 in trap ()
#15 0xc01bba4d in tcp_timer_persist ()
#16 0xc01667b1 in softclock ()
(kgdb) quit


It is no mistake, but a fact, that the reports are identical.
All pointers and addresses seem to be the same. For that reason
I guess hardware failure may be outruled, but I have no experience
in analysing crash dumps, so I can't really tell.

I'm also wondering, if/why there seem to be _two_ panics
occuring ? Or is this just regular behaviour during a crash dump.

As this is a production machine that should have long
uptimes, I did not build a debugging kernel, yet, but
will do, to get more info during the next crash (if it happens).

>How-To-Repeat:

Well, I can't tell. It just happened, but on no particular
action, that could possibly been identified.

>Fix:

No idea, BUT:

I've installed a BIOS upgrade to 1007, that seemed to solve the
problem with ECC RAM and updated to 4.1-STABLE.
So the box is currently running with ECC enabled and 4.1-STABLE,
however I cannot tell, if it will work from now on, yet, but
I keep you up to date.


>Release-Note:
>Audit-Trail:
>Unformatted:


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-bugs" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20000808142225.2B98F17408>