From owner-freebsd-stable Mon Jan 14 5:23:31 2002 Delivered-To: freebsd-stable@freebsd.org Received: from mail.gactr.uga.edu (mail.gactr.uga.edu [128.192.37.21]) by hub.freebsd.org (Postfix) with SMTP id 156F237B402 for ; Mon, 14 Jan 2002 05:23:25 -0800 (PST) Received: (qmail 25392 invoked from network); 14 Jan 2002 13:23:19 -0000 Received: from smtpgw.servers.nat (HELO smtpgw.gactr.uga.edu) ([10.10.10.25]) (envelope-sender ) by 0 (qmail-ldap-1.03) with SMTP for ; 14 Jan 2002 13:23:19 -0000 Received: (qmail 1587 invoked from network); 14 Jan 2002 13:23:19 -0000 Received: from unknown (HELO gactr.uga.edu) (10.10.100.125) by smtpgw.servers.nat with SMTP; 14 Jan 2002 13:23:18 -0000 Message-ID: <3C42DBC6.532E932@gactr.uga.edu> Date: Mon, 14 Jan 2002 08:23:18 -0500 From: "Robin P. Blanchard" Organization: Georgia Center Information Technology X-Mailer: Mozilla 4.79 [en] (X11; U; Linux 2.4.2 i386) X-Accept-Language: en MIME-Version: 1.0 To: dillon@apollo.backplane.com, stable@freebsd.org Subject: dell 4350 SMP faults Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG I'm sure Matt's pretty busy right now with the release of 4.5 so I'm sending this to the list to see if anyone else may have any ideas. Background: Dell 4350, dual p3-600 -- before FreeBSD was an NT 4.0 app/dbase server (also as SMP). First installed 4.4 STABLE on it back in october or november. As a uniprocessor box it runs without a hitch (relatively no load, no swapping, etc). As an SMP box, it *randomly* hangs. Matt had me remove gigE support and other superfluous kernel options all to no avail. Last guess was RAM. I tested all three sticks individually yesterday. I suppose it's possible that all three are bad but that probability is most likely extremely low. The box is based on 4.5-RC from Sunday morning. Here's the gdb output from all three hangs. Maybe it jogs someone's brain. Thanks in advance. 3rd dump, 3rst stick o'RAM: #0 dumpsys () at ../../kern/kern_shutdown.c:473 #1 0xc016cbd4 in boot (howto=260) at ../../kern/kern_shutdown.c:313 #2 0xc016d005 in panic (fmt=0xc0264f84 "from debugger") at ../../kern/kern_shutdown.c:581 #3 0xc0135f8d in db_panic (addr=-1072298682, have_addr=0, count=-1, modif=0xff80fdcc "") at ../../ddb/db_command.c:435 #4 0xc0135f2b in db_command (last_cmdp=0xc0299744, cmd_table=0xc0299584, aux_cmd_tablep=0xc02b5c58) at ../../ddb/db_command.c:333 #5 0xc0135ff2 in db_command_loop () at ../../ddb/db_command.c:457 #6 0xc01381a3 in db_trap (type=12, code=0) at ../../ddb/db_trap.c:71 #7 0xc023a134 in kdb_trap (type=12, code=0, regs=0xff80ff28) at ../../i386/i386/db_interface.c:158 #8 0xc024e59e in trap_fatal (frame=0xff80ff28, eva=84) at ../../i386/i386/trap.c:951 #9 0xc024e21d in trap_pfault (frame=0xff80ff28, usermode=0, eva=84) at ../../i386/i386/trap.c:849 #10 0xc024dd63 in trap (frame={tf_fs = -1072168936, tf_es = -1072168944, tf_ds = 1644167184, tf_edi = 0, tf_esi = 0, tf_ebp = -8323212, tf_isp = -8323244, tf_ebx = 0, tf_edx = 1, tf_ecx = 0, tf_eax = -962140416, tf_trapno = 12, tf_err = 0, tf_eip = -1072298682, tf_cs = 8, tf_eflags = 66050, tf_esp = -962140416, tf_ss = 0}) at ../../i386/i386/trap.c:448 #11 0xc0160546 in sync_other_counter () at ../../kern/kern_clock.c:741 #12 0xc01605b8 in tco_forward (force=0) at ../../kern/kern_clock.c:760 #13 0xc015fce3 in hardclock (frame=0xff80ffb4) at ../../kern/kern_clock.c:235 #14 0xc025052d in clkintr (frame=Cannot access memory at address 0xff80ffb4. ) at ../../i386/isa/clock.c:216 2nd dump, 2nd stick o'RAM: #0 dumpsys () at ../../kern/kern_shutdown.c:473 #1 0xc016cbd4 in boot (howto=260) at ../../kern/kern_shutdown.c:313 #2 0xc016d005 in panic (fmt=0xc0264f84 "from debugger") at ../../kern/kern_shutdown.c:581 #3 0xc0135f8d in db_panic (addr=2052, have_addr=0, count=-1, modif=0xff80fd8c "") at ../../ddb/db_command.c:435 #4 0xc0135f2b in db_command (last_cmdp=0xc0299744, cmd_table=0xc0299584, aux_cmd_tablep=0xc02b5c58) at ../../ddb/db_command.c:333 #5 0xc0135ff2 in db_command_loop () at ../../ddb/db_command.c:457 #6 0xc01381a3 in db_trap (type=12, code=0) at ../../ddb/db_trap.c:71 #7 0xc023a134 in kdb_trap (type=12, code=0, regs=0xff80fee8) at ../../i386/i386/db_interface.c:158 #8 0xc024e59e in trap_fatal (frame=0xff80fee8, eva=2052) at ../../i386/i386/trap.c:951 #9 0xc024e21d in trap_pfault (frame=0xff80fee8, usermode=0, eva=2052) at ../../i386/i386/trap.c:849 #10 0xc024dd63 in trap (frame={tf_fs = 24, tf_es = 16, tf_ds = 16, tf_edi = 1644167168, tf_esi = -1062691328, tf_ebp = -8323264, tf_isp = -8323308, tf_ebx = -1062691328, tf_edx = 1711277060, tf_ecx = -960954112, tf_eax = 2052, tf_trapno = 12, tf_err = 0, tf_eip = 2052, tf_cs = 8, tf_eflags = 66050, tf_esp = -1072130663, tf_ss = -736729416}) at ../../i386/i386/trap.c:448 #11 0x804 in ?? () #12 0xc01ba827 in in_arpinput (m=0xc0a89e00) at ../../netinet/if_ether.c:709 #13 0xc01ba33e in arpintr () at ../../netinet/if_ether.c:486 -------------------------------------------------------------------- 1st dump, 1st stick o'RAM: #0 dumpsys () at ../../kern/kern_shutdown.c:473 #1 0xc016cbd4 in boot (howto=260) at ../../kern/kern_shutdown.c:313 #2 0xc016d005 in panic (fmt=0xc0264f84 "from debugger") at ../../kern/kern_shutdown.c:581 #3 0xc0135f8d in db_panic (addr=-1071748546, have_addr=0, count=-1, modif=0xd8290c64 "") at ../../ddb/db_command.c:435 #4 0xc0135f2b in db_command (last_cmdp=0xc0299744, cmd_table=0xc0299584, aux_cmd_tablep=0xc02b5c58) at ../../ddb/db_command.c:333 #5 0xc0135ff2 in db_command_loop () at ../../ddb/db_command.c:457 #6 0xc01381a3 in db_trap (type=12, code=0) at ../../ddb/db_trap.c:71 #7 0xc023a134 in kdb_trap (type=12, code=0, regs=0xd8290dc0) at ../../i386/i386/db_interface.c:158 #8 0xc024e59e in trap_fatal (frame=0xd8290dc0, eva=141217160) at ../../i386/i386/trap.c:951 #9 0xc024e21d in trap_pfault (frame=0xd8290dc0, usermode=0, eva=141217160) at ../../i386/i386/trap.c:849 #10 0xc024dd63 in trap (frame={tf_fs = 24, tf_es = 16, tf_ds = 16, tf_edi = -700825152, tf_esi = -1062677760, tf_ebp = -668398076, tf_isp = -668398100, tf_ebx = -1071385936, tf_edx = -1071385936, tf_ecx = 141204568, tf_eax = -1071385928, tf_trapno = 12, tf_err = 0, tf_eip = -1071748546, tf_cs = 8, tf_eflags = 66118, tf_esp = -1062677760, tf_ss = -668397992}) at ../../i386/i386/trap.c:448 #11 0xc01e6a3e in sk_jfree (buf=0xc023f2b8 "%Ü", size=9024) at ../../pci/if_sk.c:870 #12 0xc018c9fd in soreceive (so=0xd63a41c0, psa=0x0, uio=0xd8290ed0, mp0=0x0, controlp=0x0, flagsp=0x0) at ../../kern/uipc_socket.c:870 #13 0xc017f7c1 in soo_read (fp=0xc6d2f140, uio=0xd8290ed0, cred=0xc6d20c00, flags=0, p=0xd3a2c380) at ../../kern/sys_socket.c:68 #14 0xc017bf2b in dofileread (p=0xd3a2c380, fp=0xc6d2f140, fd=0, buf=0x804f440, nbyte=1024, offset=-1, flags=0) at ../../sys/file.h:146 #15 0xc017bdef in read (p=0xd3a2c380, uap=0xd8290f80) at ../../kern/sys_generic.c:117 #16 0xc024e919 in syscall2 (frame={tf_fs = 47, tf_es = 47, tf_ds = 47, tf_edi = 0, tf_esi = -1077937348, tf_ebp = -1077937212, tf_isp = -668397612, tf_ebx = -1077937340, tf_edx = 0, tf_ecx = 0, tf_eax = 3, tf_trapno = 7, tf_err = 2, tf_eip = 672024140, tf_cs = 31, tf_eflags = 663, tf_esp = -1077937400, tf_ss = 47}) at ../../i386/i386/trap.c:1157 #17 0xc023afab in Xint0x80_syscall () #18 0x80493e9 in ?? () #19 0x804bc0f in ?? () #20 0x804bca8 in ?? () #21 0x804bd89 in ?? () #22 0x8049aea in ?? () #23 0x80498e4 in ?? () #24 0x80488d5 in ?? () To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message