Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 22 Aug 2003 00:18:29 +0300
From:      Alexander Motin <mav@alkar.net>
To:        freebsd-smp@freebsd.org
Subject:   5.1-p2 locks up with SMP kernel
Message-ID:  <3F453725.5040800@alkar.net>

next in thread | raw e-mail | index | archive | help
Hi!

I have Dual P4 Xeon 2.4GHz system with Supermicro X5DEI-GG motherboard. 
It based on Serverworks chipset and have 2 em ethernet card and ATI 
video onboard. Also there installed PCI Adaptec 2940 Ultra SCSI adapter 
with 2 HDD. This hardware installed into Supermicro server case with 420 
Watt PSU. AMI Flash BIOS was updated to the last 1.2c version.

When I use UP or SMP 4.8 kernel - all works fine.
When I use UP 5.1-p2 kernel - all works fine.
But when I load SMP 5.1-p2 kernel and try to make heavy system load 
(such as building of mysql from sources by 'make -j 8' command) system 
locks up after about 30 seconds of working. There is no any error 
message on console or in logs when it happends and only way to restart 
system is a reset button.

This problem is not depend on HT enabled or not and ACPI enabled or not 
in BIOS setup and enabling/disabling of the em cards.

Can anybody recommend me how to fix this situation or how to get any 
debug information on this lock up?

I am trying GENERIC kernel config with only two SMP required parameters 
added and 386,486,586 CPU types removed.

Here is my dmesg output with HT and ACPI enabled:

Copyright (c) 1992-2003 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
         The Regents of the University of California. All rights reserved.
FreeBSD 5.1-RELEASE-p2 #1: Thu Aug 21 17:09:01 EEST 2003
     root@atas1.alkar.net:/usr/src/sys/i386/compile/ATAS
Preloaded elf kernel "/boot/kernel/kernel" at 0xc0732000.
Preloaded elf module "/boot/kernel/acpi.ko" at 0xc07322bc.
Timecounter "i8254"  frequency 1193182 Hz
Timecounter "TSC"  frequency 2400107740 Hz
CPU: Intel(R) Xeon(TM) CPU 2.40GHz (2400.11-MHz 686-class CPU)
   Origin = "GenuineIntel"  Id = 0xf25  Stepping = 5
 
Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE>
   Hyperthreading: 2 logical CPUs
real memory  = 1073676288 (1023 MB)
avail memory = 1035366400 (987 MB)
Programming 16 pins in IOAPIC #0
IOAPIC #0 intpin 2 -> irq 0
Programming 16 pins in IOAPIC #1
Programming 16 pins in IOAPIC #2
Programming 16 pins in IOAPIC #3
FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
  cpu0 (BSP): apic id:  0, version: 0x00050014, at 0xfee00000
  cpu1 (AP):  apic id:  1, version: 0x00050014, at 0xfee00000
  cpu2 (AP):  apic id:  6, version: 0x00050014, at 0xfee00000
  cpu3 (AP):  apic id:  7, version: 0x00050014, at 0xfee00000
  io0 (APIC): apic id:  8, version: 0x000f0011, at 0xfec00000
  io1 (APIC): apic id:  9, version: 0x000f0011, at 0xfec01000
  io2 (APIC): apic id: 10, version: 0x000f0011, at 0xfec02000
  io3 (APIC): apic id: 11, version: 0x000f0011, at 0xfec03000
Pentium Pro MTRR support enabled
     ACPI-0660: *** Warning: Type override - [DEB_] had invalid type 
(Integer) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [MLIB] had invalid type 
(Integer) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [IO__] had invalid type 
(Integer) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [DATA] had invalid type 
(String) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [SIO_] had invalid type 
(String) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [SB__] had invalid type 
(String) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [PM__] had invalid type 
(String) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [ICNT] had invalid type 
(String) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [ACPI] had invalid type 
(String) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [IORG] had invalid type 
(String) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [SB__] had invalid type 
(String) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [PM__] had invalid type 
(String) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [SIO_] had invalid type 
(String) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [PM__] had invalid type 
(String) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [BIOS] had invalid type 
(Integer) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [CMOS] had invalid type 
(Integer) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [KBC_] had invalid type 
(Integer) for Scope operator, changed to (Scope)
     ACPI-0660: *** Warning: Type override - [OEM_] had invalid type 
(Integer) for Scope operator, changed to (Scope)
npx0: <math processor> on motherboard
npx0: INT 16 interface
acpi0: <RCC    GCHE    > on motherboard
acpi0: power button is handled as a fixed feature programming model.
Timecounter "ACPI-safe"  frequency 3579545 Hz
pcibios: BIOS version 2.10
Using $PIR table, 11 entries at 0xc00f4a50
unknown: I/O range not supported
acpi_timer0: <32-bit timer at 3.579545MHz> port 0x508-0x50b on acpi0
acpi_cpu0: <CPU> on acpi0
acpi_cpu1: <CPU> on acpi0
acpi_cpu2: <CPU> on acpi0
acpi_cpu3: <CPU> on acpi0
acpi_cpu4: <CPU> on acpi0
acpi_cpu5: <CPU> on acpi0
acpi_cpu6: <CPU> on acpi0
acpi_cpu7: <CPU> on acpi0
acpi_button0: <Sleep Button> on acpi0
pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
pci0: <ACPI PCI bus> on pcib0
IOAPIC #1 intpin 6 -> irq 2
IOAPIC #1 intpin 12 -> irq 9
IOAPIC #1 intpin 10 -> irq 10
ahc0: <Adaptec 2940 Ultra SCSI adapter> port 0xe400-0xe4ff mem 
0xfebfe000-0xfebfefff irq 2 at device 6.0 on pci0
aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs
em0: <Intel(R) PRO/1000 Network Connection, Version - 1.5.31> port 
0xd800-0xd83f mem 0xfeb80000-0xfeb9ffff irq 9 at device 8.0 on pci0
em0:  Speed:100 Mbps  Duplex:Full
em1: <Intel(R) PRO/1000 Network Connection, Version - 1.5.31> port 
0xe000-0xe03f mem 0xfeba0000-0xfebbffff irq 10 at device 9.0 on pci0
em1:  Speed:N/A  Duplex:N/A
pci0: <display, VGA> at device 11.0 (no driver attached)
atapci0: <ServerWorks CSB6 UDMA100 controller> port 
0xffa0-0xffaf,0x374-0x377,0x170-0x177,0x3f4-0x3f7,0x1f0-0x1f7 at device 
15.1 on pci0
ata0: at 0x1f0 irq 14 on atapci0
ata1: at 0x170 irq 15 on atapci0
isab0: <PCI-ISA bridge> at device 15.3 on pci0
isa0: <ISA bus> on isab0
atkbdc0: <Keyboard controller (i8042)> port 0x64,0x60 irq 1 on acpi0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
fdc0: cmd 3 failed at out byte 1 of 3
fdc0: cmd 3 failed at out byte 1 of 3
orm0: <Option ROMs> at iomem 0xc8000-0xcefff,0xc0000-0xc7fff on isa0
pmtimer0 on isa0
fdc0: <Enhanced floppy controller (i82077, NE72065 or clone)> at port 
0x3f7,0x3f0-0x3f5 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold
ppc0: parallel port not found.
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
sio0: configured irq 4 not in bitmap of probed irqs 0
sio0: port may not be enabled
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 8250 or not responding
sio1: configured irq 3 not in bitmap of probed irqs 0
sio1: port may not be enabled
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
APIC_IO: Testing 8254 interrupt delivery
APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0 
intpin 2
APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0
Timecounters tick every 10.000 msec
Waiting 15 seconds for SCSI devices to settle
acpi_cpu: throttling enabled, 16 steps (100% to 6.2%), currently 100.0%
da0 at ahc0 bus 0 target 1 lun 0
da0: <IBM DDYS-T18350N S93E> Fixed Direct Access SCSI-3 device
da0: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing 
Enabled
da0: 17501MB (35843670 512 byte sectors: 255H 63S/T 2231C)
da1 at ahc0 bus 0 target 2 lun 0
da1: <IBM IC35L036UWD210-0 S5BS> Fixed Direct Access SCSI-3 device
da1: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing 
Enabled
da1: 35003MB (71687340 512 byte sectors: 255H 63S/T 4462C)
SMP: AP CPU #1 Launched!
SMP: AP CPU #2 Launched!
SMP: AP CPU #3 Launched!
Mounting root from ufs:/dev/da0s1a
WARNING: / was not properly dismounted
/: mount pending error: blocks 524 files 82
ipfw2 initialized, divert disabled, rule-based forwarding enabled, 
default to deny, logging disabled

-- 
Alexander Motin



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?3F453725.5040800>