From owner-freebsd-stable@FreeBSD.ORG Sun Mar 21 10:00:42 2004 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 0C4A916A4CE for ; Sun, 21 Mar 2004 10:00:42 -0800 (PST) Received: from smtp2.mc.surewest.net (smtp2.mc.surewest.net [66.60.130.51]) by mx1.FreeBSD.org (Postfix) with SMTP id B5A3543D3F for ; Sun, 21 Mar 2004 10:00:41 -0800 (PST) (envelope-from dislists@updegrove.net) Received: (qmail 6742 invoked from network); 21 Mar 2004 18:00:40 -0000 Received: from unknown (HELO updegrove.net) (64.30.97.117) by smtp2.mc.surewest.net with SMTP; 21 Mar 2004 18:00:40 -0000 Received: (qmail 33752 invoked by uid 98); 21 Mar 2004 18:00:43 -0000 Received: from dislists@updegrove.net by smeagol.purgatory by uid 1008 with qmail-scanner-1.20 Clear:RC:1(192.168.0.2):. Processed in 0.113529 secs); 21 Mar 2004 18:00:43 -0000 X-Qmail-Scanner-Mail-From: dislists@updegrove.net via smeagol.purgatory X-Qmail-Scanner: 1.20 (Clear:RC:1(192.168.0.2):. Processed in 0.113529 secs) Received: from unknown (HELO updegrove.net) (192.168.0.2) by updegrove.net with SMTP; 21 Mar 2004 18:00:42 -0000 Message-ID: <405DD931.9080303@updegrove.net> Date: Sun, 21 Mar 2004 10:04:33 -0800 From: Rick Updegrove User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.6) Gecko/20040113 X-Accept-Language: en-us, en MIME-Version: 1.0 To: freebsd-stable@freebsd.org References: <005401c3c575$674d4ab0$41c3c3cf@office.sihope.com> In-Reply-To: <005401c3c575$674d4ab0$41c3c3cf@office.sihope.com> Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit X-TST: smtp2.mc.surewest.net SNWK2 0.31-33 ip=64.30.97.117 Subject: Re: SMP forcing crash/reboot in 4.9-STABLE X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 21 Mar 2004 18:00:42 -0000 A 4.8-STABLE machine I have been running with no problems for over 130 days straight uptime is now having unexplained reboots. They are not every day, or predictable, but they are happening. It is a low traffic qmail-scanner machine (7k a day) and I upgraded due to http://www.freebsd.org/releases/4.9R/errata.html Now I am sort of wishing I did not : ) I lost all the uptime and now the unexplained rebooting... I hesitate reporting this because most people point their fingers at the hardware. I am tempted to abandon this machine for another but if anyone is interested in taking a look please advise. Currently, I don't have the ability to #/etc/rc.conf #rebooting dumpdev=YES savecore=YES dumpdir="/var/crash" Is there anything else I should do? Right now I do not have the ability to attach a serial console to the crashing system and set the system to serial console. And even if I did have physical access I am not sure how to do that exactly... Is there another way to accomplish the debugging of this? I have been running FreeBSD so long with no problems I am sort of rusty at tracking them down, especially the elusive ones. So please point me in the right direction. Thanks! Rick P.S. dmesg follows Copyright (c) 1992-2003 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.9-STABLE #0: Wed Mar 3 00:02:36 PST 2004 root@govmail.ca.gov:/usr/obj/usr/src/sys/SMP Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon/Celeron (499.15-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x673 Stepping = 3 Features=0x387fbff real memory = 536870912 (524288K bytes) avail memory = 519507968 (507332K bytes) Programming 24 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 FreeBSD/SMP: Multiprocessor motherboard: 2 CPUs cpu0 (BSP): apic id: 1, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 0, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Preloaded elf kernel "kernel" at 0xc0329000. Pentium Pro MTRR support enabled md0: Malloc disk Using $PIR table, 14 entries at 0xc00fdee0 npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard IOAPIC #0 intpin 19 -> irq 2 IOAPIC #0 intpin 17 -> irq 16 pci0: on pcib0 isab0: at device 4.0 on pci0 isa0: on isab0 atapci0: port 0xfcd0-0xfcdf at device 4.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 pci0: at 4.2 irq 2 Timecounter "PIIX" frequency 3579545 Hz chip1: port 0x2180-0x218f at device 4.3 on pci0 pcib1: at device 7.0 on pci0 IOAPIC #0 intpin 16 -> irq 17 pci1: on pcib1 ahc0: port 0xe800-0xe8ff mem 0xfebfe000-0xfebfefff irq 17 at device 4.0 on pci1 aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs pci1: (vendor=0x1000, dev=0x000c) at 7.0 irq 18 amr0: mem 0xf0000000-0xf7ffffff irq 16 at device 7.1 on pci0 amr0: Firmware D.02.05, BIOS B.01.04, 16MB RAM pcib2: at device 8.0 on pci0 pci2: on pcib2 fxp0: port 0xdce0-0xdcff mem 0xfe900000-0xfe9fffff,0xefffe000-0xefffefff irq 16 at device 2.0 on pci2 fxp0: Ethernet address 00:90:27:b7:09:76 inphy0: on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto pci0: (vendor=0x103c, dev=0x10c1) at 11.0 pci0: at 13.0 orm0: