Date: Fri, 5 May 2000 11:28:55 -0700 (PDT) From: Matthew Dillon <dillon@apollo.backplane.com> To: Howard Leadmon <howardl@account.abs.net> Cc: Bill Paul <wpaul@skynet.ctr.columbia.edu>, Greg Lehey <grog@lemis.com>, freebsd-stable@FreeBSD.ORG, freebsd-hackers@FreeBSD.ORG Subject: Re: Debugging Kernel/System Crashes, can anyone help?? Message-ID: <200005051828.LAA80765@apollo.backplane.com> References: <200005051633.MAA13634@account.abs.net>
next in thread | previous in thread | raw e-mail | index | archive | help
: Hello Matt, : : Well I almost thought we had something here, as the machine actually stayed :online for over a day this past time. Anyway I did add your patch, and when :the machine died I now have a crashdump of the data with your patch included, :when I will provide below. FYI, I have kept the past crashinfo as well as :the current one, so if you need any other info just let me know. Hopefully :something in this latest crash will stick out to you.. :... This looks to be a different bug. I think it may be in Bill Paul's department. It looks like either 'sc' is NULL, or 'sc->dc_btag' or 'sc->dc_bhandle' is NULL in dc_intr. This could be SMP/interrupt related. It's hard to tell exactly which line is crunching due to the inline I/O instruction in machien/cpufunc.h. It would help if you could 'nm kernel.debug | fgrep dc_intr' to locate the start adderss of the dc_intr procedure. It would also help if you could do the following from the gdb of the kernel: frame 14 print sc print *sc -Matt Matthew Dillon <dillon@backplane.com> :Fatal trap 12: page fault while in kernel mode :mp_lock = 01000003; cpuid = 1; lapic.id = 01000000 :fault virtual address = 0x30 :fault code = supervisor read, page not present :instruction pointer = 0x8:0xc01cdca5 :stack pointer = 0x10:0xff80ddd0 :frame pointer = 0x10:0xff80ddd4 :code segment = base 0x0, limit 0xfffff, type 0x1b : = DPL 0, pres 1, def32 1, gran 1 :processor eflags = interrupt enabled, resume, IOPL = 0 :current process = Idle :interrupt mask = net bio cam <- SMP: XXX :trap number = 12 :panic: page fault :mp_lock = 01000003; cpuid = 1; lapic.id = 01000000 :boot() called on cpu#1 :Uptime: 1h52m40s : :dumping to dev #ad/0x20001, offset 128 :--- :#12 0xc02283d6 in trap_fatal (frame=0xff80df44, eva=0) : at ../../i386/i386/trap.c:926 :#13 0xc0227d92 in trap (frame={tf_fs = -1072365544, tf_es = 1174929424, : tf_ds = -1059192816, tf_edi = -194641856, tf_esi = -1059168256, : tf_ebp = -8331372, tf_isp = -8331408, tf_ebx = -1059161408, : tf_edx = 54312, tf_ecx = 0, tf_eax = -194641856, tf_trapno = 29, : tf_err = 0, tf_eip = -1071900421, tf_cs = 8, tf_eflags = 582, : tf_esp = -1059161408, tf_ss = 1074266112}) at ../../i386/i386/trap.c:586 :#14 0xc01c18fb in dc_intr (arg=0xc0de6000) at machine/cpufunc.h:331 :#15 0xc0230953 in intr_mux (arg=0xc0de7b20) : at ../../i386/isa/intr_machdep.c:569 :(kgdb) q :# :--- :Howard Leadmon - howardl@abs.net - http://www.abs.net :ABSnet Internet Services - Phone: 410-361-8160 - FAX: 410-361-8162 : To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200005051828.LAA80765>