From owner-freebsd-smp Sun Apr 29 14:42:31 2001 Delivered-To: freebsd-smp@freebsd.org Received: from klima.physik.uni-mainz.de (klima.Physik.Uni-Mainz.DE [134.93.180.162]) by hub.freebsd.org (Postfix) with ESMTP id CFDA637B423; Sun, 29 Apr 2001 14:42:14 -0700 (PDT) (envelope-from ohartman@klima.physik.uni-mainz.de) Received: from klima.Physik.Uni-Mainz.DE (Linux@klima.Physik.Uni-Mainz.DE [134.93.180.162]) by klima.physik.uni-mainz.de (8.11.3/8.11.3) with ESMTP id f3TLgDw85926; Sun, 29 Apr 2001 23:42:14 +0200 (CEST) (envelope-from ohartman@klima.physik.uni-mainz.de) Date: Sun, 29 Apr 2001 23:42:13 +0200 (CEST) From: "Hartmann, O." To: Cc: Subject: Spontanous reboot of SMP system and FBSD 4.3 Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-freebsd-smp@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org Dear Sirs. One of our server systems, a TYAN Thunder 2500 ServerWorks Set III HE based dual PIII based machine with AMI Enterprise 1600 RAID and 2GB RAM seems to reboot spontanously without any reason. This is the first time I realise this without having APM enabled into the kernel, which has been detected to be a reason for spontanous reboots on boards without APM to us. Well, in the time this machine rebooted itself there was no load, no extraordinary activity or heavy load. I realised this this morning checking the logs. Several days ago someone asked in the list for stability issues concerning SMP on several architectures and I responded that our TYAN Thunder 2500 based boards have run for a long time. I think I have to revise this statement. I hope it is a simple problem with the kernel config because the selected hardware, power supplies and UPS system we use is first choice for this system and showed no problems before. Does anyone know, whether the kernel options AUTO_EOI1 and AUTO_EOI2 do have serious impacts on stability? On this machine we configured kernel with both options "ON" and have had no problems within the last 9 months. I know that many system boards do not work with both options set, but the TYAN Thunder 2500 seems to do - but could this be the reason for spontanous reboots? Please help, all hints a tips are appreciated. I add the dmesg output of the system here: Copyright (c) 1992-2001 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.3-STABLE #74: Thu Apr 26 21:25:13 CEST 2001 root@atmos.physik.uni-mainz.de:/usr/obj/usr/src/sys/ATMOS Timecounter "i8254" frequency 1193094 Hz CPU: Pentium III/Pentium III Xeon/Celeron (868.57-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x683 Stepping = 3 Features=0x387fbff real memory = 2147483648 (2097152K bytes) avail memory = 2087288832 (2038368K bytes) Programming 16 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 Programming 16 pins in IOAPIC #1 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 1, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 0, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x000f0011, at 0xfec00000 io1 (APIC): apic id: 3, version: 0x000f0011, at 0xfec01000 Preloaded elf kernel "kernel" at 0xc03d8000. Pentium Pro MTRR support enabled npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard IOAPIC #1 intpin 13 -> irq 2 IOAPIC #1 intpin 12 -> irq 16 IOAPIC #1 intpin 14 -> irq 17 IOAPIC #1 intpin 7 -> irq 18 IOAPIC #1 intpin 0 -> irq 19 pci0: on pcib0 pcib3: at device 0.1 on pci0 IOAPIC #1 intpin 1 -> irq 20 pci1: on pcib3 pci1: at 0.0 irq 20 sym0: <896> port 0xf800-0xf8ff mem 0xfeafe000-0xfeafffff,0xfeafac00-0xfeafafff irq 2 at device 1.0 on pci0 sym0: Symbios NVRAM, ID 7, Fast-40, LVD, parity checking sym0: open drain IRQ line driver, using on-chip SRAM sym0: using LOAD/STORE-based firmware. sym0: handling phase mismatch from SCRIPTS. sym1: <896> port 0xf400-0xf4ff mem 0xfeafc000-0xfeafdfff,0xfeafa800-0xfeafabff irq 16 at device 1.1 on pci0 sym1: Symbios NVRAM, ID 7, Fast-40, SE, parity checking sym1: open drain IRQ line driver, using on-chip SRAM sym1: using LOAD/STORE-based firmware. sym1: handling phase mismatch from SCRIPTS. pcm0: port 0xfcc0-0xfcff irq 17 at device 2.0 on pci0 pcib5: at device 3.0 on pci0 IOAPIC #1 intpin 2 -> irq 21 pci2: on pcib5 pcib6: at device 0.0 on pci2 pci3: on pcib6 amr0: mem 0xf4000000-0xf7ffffff irq 19 at device 0.0 on pci3 amr0: Firmware C158, BIOS 3.11, 64MB RAM pci2: (vendor=0x1077, dev=0x1216) at 1.0 irq 20 pci2: (vendor=0x1077, dev=0x1216) at 2.0 irq 21 fxp0: port 0xfc40-0xfc7f mem 0xfe900000-0xfe9fffff,0xfeaf9000-0xfeaf9fff irq 18 at device 7.0 on pci0 fxp0: Ethernet address 00:e0:81:00:f0:d7 isab0: at device 15.0 on pci0 isa0: on isab0 pci0: at 15.1 pci0: at 15.2 irq 19 pcib1: on motherboard pci4: on pcib1 pcib2: on motherboard pci5: on pcib2 pcib4: on motherboard pci6: on pcib4 atkbdc0: at port 0x60,0x64 on isa0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 psm0: irq 12 on atkbdc0 psm0: model IntelliMouse, device ID 3 vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: on isa0 sc0: VGA <8 virtual consoles, flags=0x200> fdc0: at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1 at port 0x2f8-0x2ff irq 3 flags 0x10 on isa0 sio1: type 16550A ppc0: at port 0x378-0x37f irq 7 drq 1 flags 0x8 on isa0 ppc0: SMC-like chipset (ECP-only) in ECP mode ppc0: FIFO with 16/16/8 bytes threshold lpt0: on ppbus0 lpt0: Interrupt-driven port APIC_IO: Testing 8254 interrupt delivery APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0 intpin 2 APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0 DUMMYNET initialized (010124) IP packet filtering initialized, divert enabled, rule-based forwarding enabled, default to deny, unlimited logging IPsec: Initialized Security Association Processing. Waiting 4 seconds for SCSI devices to settle (noperiph:sym0:0:-1:-1): SCSI BUS reset delivered. (noperiph:sym1:0:-1:-1): SCSI BUS reset delivered. amrd0: on amr0 amrd0: 245014MB (501788672 sectors) RAID 5 (optimal) SMP: AP CPU #1 Launched! sa0 at sym1 bus 0 target 5 lun 0 sa0: Removable Sequential Access SCSI-2 device sa0: 40.000MB/s transfers (20.000MHz, offset 31, 16bit) no devsw (majdev=0 bootdev=0xa0200000) Mounting root from ufs:/dev/amrd0s1a cd0 at sym1 bus 0 target 3 lun 0 cd0: Removable CD-ROM SCSI-2 device cd0: 20.000MB/s transfers (20.000MHz, offset 16) cd0: cd present [329963 x 2048 byte records] ch0 at sym1 bus 0 target 5 lun 1 ch0: Removable Changer SCSI-2 device ch0: 40.000MB/s transfers (20.000MHz, offset 31, 16bit) ch0: 6 slots, 1 drive, 0 pickers, 0 portals WARNING: / was not properly dismounted link_elf: symbol splash_register undefined fxp0: promiscuous mode enabled arp: runt packet arp: runt packet Thanks in advance, Oliver -- MfG O. Hartmann ohartman@klima.physik.uni-mainz.de ---------------------------------------------------------------- IT-Administration des Institut fuer Physik der Atmosphaere (IPA) ---------------------------------------------------------------- Johannes Gutenberg Universitaet Mainz Becherweg 21 55099 Mainz Tel: +496131/3924662 (Maschinensaal) Tel: +496131/3924144 FAX: +496131/3923532 To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-smp" in the body of the message