Date: Mon, 21 Apr 2008 09:46:40 +0900 From: Pyun YongHyeon <pyunyh@gmail.com> To: Mike Tancsa <mike@sentex.net> Cc: freebsd-stable@freebsd.org Subject: Re: AMD64 crash on RELENG_7 Message-ID: <20080421004640.GA44371@cdnetworks.co.kr> In-Reply-To: <200804191540.m3JFeTbI076484@lava.sentex.ca> References: <200804191540.m3JFeTbI076484@lava.sentex.ca>
next in thread | previous in thread | raw e-mail | index | archive | help
On Sat, Apr 19, 2008 at 11:40:33AM -0400, Mike Tancsa wrote: > > Not sure if this is a hardware issue or a bug, but it was the first > crash since putting the box into production a few weeks ago > > > > 0[verify1]# kgdb kernel.debug vmcore.0 > [GDB will not be able to debug user-mode threads: > /usr/lib/libthread_db.so: Undefined symbol "ps_pglobal_lookup"] > GNU gdb 6.1.1 [FreeBSD] > Copyright 2004 Free Software Foundation, Inc. > GDB is free software, covered by the GNU General Public License, and you are > welcome to change it and/or distribute copies of it under certain > conditions. > Type "show copying" to see the conditions. > There is absolutely no warranty for GDB. Type "show warranty" for details. > This GDB was configured as "amd64-marcel-freebsd". > There is no member named pathname. > (kgdb) bt > #0 doadump () at pcpu.h:194 > #1 0x0000000000000004 in ?? () > #2 0xffffffff802b0f29 in boot (howto=260) at > /usr/src/sys/kern/kern_shutdown.c:409 > #3 0xffffffff802b132d in panic (fmt=0x104 <Address 0x104 out of > bounds>) at /usr/src/sys/kern/kern_shutdown.c:563 > #4 0xffffffff8046a9b3 in vm_fault (map=0xffffff0001000000, > vaddr=18446744072332865536, fault_type=1 '\001', fault_flags=0) > at /usr/src/sys/vm/vm_fault.c:275 > #5 0xffffffff8049facf in trap_pfault (frame=0xffffffffabed4a30, > usermode=0) at /usr/src/sys/amd64/amd64/trap.c:630 > #6 0xffffffff804a03f8 in trap (frame=0xffffffffabed4a30) at > /usr/src/sys/amd64/amd64/trap.c:410 > #7 0xffffffff8048606e in calltrap () at > /usr/src/sys/amd64/amd64/exception.S:169 > #8 0xffffffff8049ebb6 in bcopy () at /usr/src/sys/amd64/amd64/support.S:123 > #9 0xffffffff802fefb4 in m_devget (buf=0xffffffffadf16b6a "", > totlen=18231, off=0, ifp=0xffffff0001041800, copy=0) ^^^^^^^^^^^^ I think this should be fixed in if_rl.c rev 1.170.2.3. m_devget(9) should never get this wrong legnth, 18231 bytes as rl(4) can't handle this huge frame size. Do you use latest if_rl.c on RELENG_7? > at /usr/src/sys/kern/uipc_mbuf.c:1233 > #10 0xffffffff8041f2cc in rl_rxeof (sc=0xffffff0002e2c000) at > /usr/src/sys/pci/if_rl.c:1179 > #11 0xffffffff8041f878 in rl_intr (arg=Variable "arg" is not available. > ) at /usr/src/sys/pci/if_rl.c:1362 > #12 0xffffffff80295530 in ithread_loop (arg=0xffffff00011fd160) at > /usr/src/sys/kern/kern_intr.c:1036 > #13 0xffffffff802925e3 in fork_exit (callout=0xffffffff802953c0 > <ithread_loop>, arg=0xffffff00011fd160, > frame=0xffffffffabed4c80) at /usr/src/sys/kern/kern_fork.c:783 > #14 0xffffffff8048643e in fork_trampoline () at > /usr/src/sys/amd64/amd64/exception.S:415 > #15 0x0000000000000000 in ?? () > #16 0x0000000000000000 in ?? () > #17 0x0000000000000001 in ?? () > #18 0x0000000000000000 in ?? () > #19 0x0000000000000000 in ?? () > #20 0x0000000000000000 in ?? () > #21 0x0000000000000000 in ?? () > #22 0x0000000000000000 in ?? () > #23 0x0000000000000000 in ?? () > #24 0x0000000000000000 in ?? () > #25 0x0000000000000000 in ?? () > #26 0x0000000000000000 in ?? () > #27 0x0000000000000000 in ?? () > #28 0x0000000000000000 in ?? () > #29 0x0000000000000000 in ?? () > #30 0x0000000000000000 in ?? () > #31 0x0000000000000000 in ?? () > #32 0x0000000000000000 in ?? () > #33 0x0000000000000000 in ?? () > #34 0x0000000000000000 in ?? () > #35 0x0000000000000000 in ?? () > #36 0x0000000000000000 in ?? () > ---Type <return> to continue, or q <return> to quit--- > #37 0x0000000000000000 in ?? () > #38 0x0000000000000000 in ?? () > #39 0x0000000000893000 in ?? () > #40 0xffffffff8069a200 in tdg_maxid () > #41 0xffffffff806a6a00 in tdq_cpu () > #42 0xffffffff806a6a00 in tdq_cpu () > #43 0xffffff000118c350 in ?? () > #44 0xffffff000118c658 in ?? () > #45 0xffffffffabed4b38 in ?? () > #46 0xffffff000118c350 in ?? () > #47 0xffffffff802cf4e8 in sched_switch (td=0xffffffff802953c0, > newtd=0x0, flags=Variable "flags" is not available. > ) at /usr/src/sys/kern/sched_ule.c:1898 > #48 0x0000000000000000 in ?? () > #49 0x0000000000000000 in ?? () > #50 0x0000000000000000 in ?? () > #51 0x0000000000000000 in ?? () > #52 0x0000000000000000 in ?? () > #53 0x0000000000000000 in ?? () > #54 0x0000000000000000 in ?? () > #55 0x0000000000000000 in ?? () > #56 0x0000000000000000 in ?? () > #57 0x0000000000000000 in ?? () > #58 0x0000000000000000 in ?? () > #59 0x0000000000000000 in ?? () > #60 0x0000000000000000 in ?? () > #61 0x0000000000000000 in ?? () > #62 0x0000000000000000 in ?? () > #63 0x0000000000000000 in ?? () > #64 0x0000000000000000 in ?? () > #65 0x0000000000000000 in ?? () > #66 0x0000000000000000 in ?? () > #67 0x0000000000000000 in ?? () > #68 0x0000000000000000 in ?? () > #69 0x0000000000000000 in ?? () > #70 0x0000000000000000 in ?? () > #71 0x0000000000000000 in ?? () > #72 0x0000000000000000 in ?? () > #73 0x0000000000000000 in ?? () > #74 0x0000000000000000 in ?? () > #75 0x0000000000000000 in ?? () > #76 0x0000000000000000 in ?? () > ---Type <return> to continue, or q <return> to quit--- > #77 0x0000000000000000 in ?? () > #78 0x0000000000000000 in ?? () > #79 0x0000000000000000 in ?? () > #80 0x0000000000000000 in ?? () > #81 0x0000000000000000 in ?? () > #82 0x0000000000000000 in ?? () > #83 0x0000000000000000 in ?? () > #84 0x0000000000000000 in ?? () > #85 0x0000000000000000 in ?? () > #86 0x0000000000000000 in ?? () > #87 0x0000000000000000 in ?? () > #88 0x0000000000000000 in ?? () > #89 0x0000000000000000 in ?? () > #90 0x0000000000000000 in ?? () > #91 0x0000000000000000 in ?? () > #92 0x0000000000000000 in ?? () > #93 0x0000000000000000 in ?? () > #94 0x0000000000000000 in ?? () > #95 0x0000000000000000 in ?? () > #96 0x0000000000000000 in ?? () > #97 0x0000000000000000 in ?? () > #98 0x0000000000000000 in ?? () > #99 0x0000000000000000 in ?? () > #100 0x0000000000000000 in ?? () > #101 0x0000000000000000 in ?? () > #102 0x0000000000000000 in ?? () > #103 0x0000000000000000 in ?? () > #104 0x0000000000000000 in ?? () > #105 0x0000000000000000 in ?? () > #106 0x0000000000000000 in ?? () > #107 0x0000000000000000 in ?? () > #108 0x0000000000000000 in ?? () > #109 0x0000000000000000 in ?? () > #110 0x0000000000000000 in ?? () > #111 0x0000000000000000 in ?? () > #112 0x0000000000000000 in ?? () > #113 0x0000000000000000 in ?? () > #114 0x0000000000000000 in ?? () > #115 0x0000000000000000 in ?? () > Cannot access memory at address 0xffffffffabed5000 > > Stable from Mar 26th > > cat /var/run/dmesg.boot > panic: vm_fault: fault on nofault entry, addr: ffffffffadf17000 > cpuid = 0 > Uptime: 22d23h55m31s > Physical memory: 4054 MB > Dumping 519 MB: 504 488 472 456 440 424 408 392 376 360 344 328 312 > 296 280 264 248 232 216 200 184 168 152 136 120 104 88 72 56 40 24 > 8Copyright (c) 1992-2008 The FreeBSD Project. > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 > The Regents of the University of California. All rights reserved. > FreeBSD is a registered trademark of The FreeBSD Foundation. > FreeBSD 7.0-STABLE #1: Wed Mar 26 17:43:52 EDT 2008 > cgreen@verify1-new.sentex.ca:/usr/obj/usr/src/sys/Verify1-64 > Timecounter "i8254" frequency 1193182 Hz quality 0 > CPU: AMD Athlon(tm) 64 Processor 3800+ (2399.75-MHz K8-class CPU) > Origin = "AuthenticAMD" Id = 0x40ff2 Stepping = 2 > Features=0x78bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2> > Features2=0x2001<SSE3,CX16> > AMD Features=0xea500800<SYSCALL,NX,MMX+,FFXSR,RDTSCP,LM,3DNow!+,3DNow!> > AMD Features2=0x1d<LAHF,SVM,ExtAPIC,CR8> > usable memory = 4251840512 (4054 MB) > avail memory = 4082696192 (3893 MB) > ACPI APIC Table: <RS485 AWRDACPI> > ioapic0 <Version 2.1> irqs 0-23 on motherboard > kbd1 at kbdmux0 > acpi0: <RS485 AWRDACPI> on motherboard > acpi0: Overriding SCI Interrupt from IRQ 9 to IRQ 21 > acpi0: [ITHREAD] > acpi0: Power Button (fixed) > acpi0: reservation of 0, a0000 (3) failed > acpi0: reservation of 100000, bdde0000 (3) failed > Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 > acpi_timer0: <32-bit timer at 3.579545MHz> port 0x4008-0x400b on acpi0 > cpu0: <ACPI CPU> on acpi0 > powernow0: <PowerNow! K8> on cpu0 > acpi_button0: <Power Button> on acpi0 > pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0 > pci0: <ACPI PCI bus> on pcib0 > pcib1: <ACPI PCI-PCI bridge> at device 1.0 on pci0 > pci1: <ACPI PCI bus> on pcib1 > vgapci0: <VGA-compatible display> port 0xee00-0xeeff mem > 0xf8000000-0xfbffffff,0xfdef0000-0xfdefffff irq 17 at device 5.0 on pci1 > atapci0: <ATI IXP400 SATA150 controller> port > 0xfe00-0xfe07,0xfd00-0xfd03,0xfc00-0xfc07,0xfb00-0xfb03,0xfa00-0xfa0f > mem 0xfe02f000-0xfe02f1ff irq 23 at device 17.0 on pci0 > atapci0: [ITHREAD] > ata2: <ATA channel 0> on atapci0 > ata2: [ITHREAD] > ata3: <ATA channel 1> on atapci0 > ata3: [ITHREAD] > atapci1: <ATI IXP400 SATA150 controller> port > 0xf900-0xf907,0xf800-0xf803,0xf700-0xf707,0xf600-0xf603,0xf500-0xf50f > mem 0xfe02e000-0xfe02e1ff irq 22 at device 18.0 on pci0 > atapci1: [ITHREAD] > ata4: <ATA channel 0> on atapci1 > ata4: [ITHREAD] > ata5: <ATA channel 1> on atapci1 > ata5: [ITHREAD] > ohci0: <ATI SB400 USB Controller> mem 0xfe02d000-0xfe02dfff at device > 19.0 on pci0 > ohci0: [GIANT-LOCKED] > ohci0: [ITHREAD] > usb0: OHCI version 1.0, legacy support > usb0: SMM does not respond, resetting > usb0: <ATI SB400 USB Controller> on ohci0 > usb0: USB revision 1.0 > uhub0: <ATI OHCI root hub, class 9/0, rev 1.00/1.00, addr 1> on usb0 > uhub0: 4 ports with 4 removable, self powered > ohci1: <ATI SB400 USB Controller> mem 0xfe02c000-0xfe02cfff at device > 19.1 on pci0 > ohci1: [GIANT-LOCKED] > ohci1: [ITHREAD] > usb1: OHCI version 1.0, legacy support > usb1: SMM does not respond, resetting > usb1: <ATI SB400 USB Controller> on ohci1 > usb1: USB revision 1.0 > uhub1: <ATI OHCI root hub, class 9/0, rev 1.00/1.00, addr 1> on usb1 > uhub1: 4 ports with 4 removable, self powered > ehci0: <ATI SB400 USB 2.0 controller> mem 0xfe02b000-0xfe02bfff at > device 19.2 on pci0 > ehci0: [GIANT-LOCKED] > ehci0: [ITHREAD] > usb2: waiting for BIOS to give up control > usb2: timed out waiting for BIOS > usb2: EHCI version 1.0 > usb2: companion controllers, 4 ports each: usb0 usb1 > usb2: <ATI SB400 USB 2.0 controller> on ehci0 > usb2: USB revision 2.0 > uhub2: <ATI EHCI root hub, class 9/0, rev 2.00/1.00, addr 1> on usb2 > uhub2: 8 ports with 8 removable, self powered > pci0: <serial bus, SMBus> at device 20.0 (no driver attached) > atapci2: <ATI IXP400 UDMA133 controller> port > 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xf300-0xf30f at device 20.1 on pci0 > ata0: <ATA channel 0> on atapci2 > ata0: [ITHREAD] > ata1: <ATA channel 1> on atapci2 > ata1: [ITHREAD] > isab0: <PCI-ISA bridge> at device 20.3 on pci0 > isa0: <ISA bus> on isab0 > pcib2: <ACPI PCI-PCI bridge> at device 20.4 on pci0 > pci2: <ACPI PCI bus> on pcib2 > rl0: <RealTek 8139 10/100BaseTX> port 0xde00-0xdeff mem > 0xfddff000-0xfddff0ff irq 22 at device 5.0 on pci2 > miibus0: <MII bus> on rl0 > rlphy0: <RealTek internal media interface> PHY 0 on miibus0 > rlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto > rl0: Ethernet address: 00:19:21:20:48:b8 > rl0: [ITHREAD] > acpi_tz0: <Thermal Zone> on acpi0 > sio0: configured irq 4 not in bitmap of probed irqs 0 > sio0: port may not be enabled > sio0: configured irq 4 not in bitmap of probed irqs 0 > sio0: port may not be enabled > sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on > acpi0 > sio0: type 16550A > sio0: [FILTER] > orm0: <ISA Option ROM> at iomem 0xc0000-0xcefff on isa0 > atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0 > atkbd0: <AT Keyboard> irq 1 on atkbdc0 > kbd0 at atkbd0 > atkbd0: [GIANT-LOCKED] > atkbd0: [ITHREAD] > sc0: <System console> at flags 0x100 on isa0 > sc0: VGA <16 virtual consoles, flags=0x300> > sio1: configured irq 3 not in bitmap of probed irqs 0 > sio1: port may not be enabled > vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 > ugen0: <American Power Conversion Back-UPS RS 1500 FW:8.g8 .D USB > FW:g8, class 0/0, rev 1.10/1.06, addr 2> on uhub1 > Timecounter "TSC" frequency 2399746333 Hz quality 800 > Timecounters tick every 1.000 msec > ad10: 76319MB <Seagate ST380815AS 3.AAD> at ata5-master SATA150 > Trying to mount root from ufs:/dev/ad10s1a > WARNING: / was not properly dismounted > WARNING: /tmp was not properly dismounted > WARNING: /usr was not properly dismounted > WARNING: /var was not properly dismounted > /var: mount pending error: blocks 292 files 10 > rl0: link state changed to UP > -- Regards, Pyun YongHyeon
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20080421004640.GA44371>