From owner-freebsd-smp@FreeBSD.ORG Thu Aug 21 14:18:38 2003 Return-Path: Delivered-To: freebsd-smp@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id C3C1C16A4BF for ; Thu, 21 Aug 2003 14:18:38 -0700 (PDT) Received: from mail.alkar.net (mail.alkar.net [195.248.191.95]) by mx1.FreeBSD.org (Postfix) with ESMTP id D349143FDF for ; Thu, 21 Aug 2003 14:18:34 -0700 (PDT) (envelope-from mav@alkar.net) Received: from [212.86.233.2] (HELO alkar.net) by mail.alkar.net (CommuniGate Pro SMTP 4.1) with ESMTP id 98514618 for freebsd-smp@freebsd.org; Fri, 22 Aug 2003 00:18:29 +0300 Message-ID: <3F453725.5040800@alkar.net> Date: Fri, 22 Aug 2003 00:18:29 +0300 From: Alexander Motin User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.4) Gecko/20030624 X-Accept-Language: ru, en-us, en MIME-Version: 1.0 To: freebsd-smp@freebsd.org Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit Subject: 5.1-p2 locks up with SMP kernel X-BeenThere: freebsd-smp@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: FreeBSD SMP implementation group List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 21 Aug 2003 21:18:39 -0000 Hi! I have Dual P4 Xeon 2.4GHz system with Supermicro X5DEI-GG motherboard. It based on Serverworks chipset and have 2 em ethernet card and ATI video onboard. Also there installed PCI Adaptec 2940 Ultra SCSI adapter with 2 HDD. This hardware installed into Supermicro server case with 420 Watt PSU. AMI Flash BIOS was updated to the last 1.2c version. When I use UP or SMP 4.8 kernel - all works fine. When I use UP 5.1-p2 kernel - all works fine. But when I load SMP 5.1-p2 kernel and try to make heavy system load (such as building of mysql from sources by 'make -j 8' command) system locks up after about 30 seconds of working. There is no any error message on console or in logs when it happends and only way to restart system is a reset button. This problem is not depend on HT enabled or not and ACPI enabled or not in BIOS setup and enabling/disabling of the em cards. Can anybody recommend me how to fix this situation or how to get any debug information on this lock up? I am trying GENERIC kernel config with only two SMP required parameters added and 386,486,586 CPU types removed. Here is my dmesg output with HT and ACPI enabled: Copyright (c) 1992-2003 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.1-RELEASE-p2 #1: Thu Aug 21 17:09:01 EEST 2003 root@atas1.alkar.net:/usr/src/sys/i386/compile/ATAS Preloaded elf kernel "/boot/kernel/kernel" at 0xc0732000. Preloaded elf module "/boot/kernel/acpi.ko" at 0xc07322bc. Timecounter "i8254" frequency 1193182 Hz Timecounter "TSC" frequency 2400107740 Hz CPU: Intel(R) Xeon(TM) CPU 2.40GHz (2400.11-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0xf25 Stepping = 5 Features=0xbfebfbff Hyperthreading: 2 logical CPUs real memory = 1073676288 (1023 MB) avail memory = 1035366400 (987 MB) Programming 16 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 Programming 16 pins in IOAPIC #1 Programming 16 pins in IOAPIC #2 Programming 16 pins in IOAPIC #3 FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs cpu0 (BSP): apic id: 0, version: 0x00050014, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00050014, at 0xfee00000 cpu2 (AP): apic id: 6, version: 0x00050014, at 0xfee00000 cpu3 (AP): apic id: 7, version: 0x00050014, at 0xfee00000 io0 (APIC): apic id: 8, version: 0x000f0011, at 0xfec00000 io1 (APIC): apic id: 9, version: 0x000f0011, at 0xfec01000 io2 (APIC): apic id: 10, version: 0x000f0011, at 0xfec02000 io3 (APIC): apic id: 11, version: 0x000f0011, at 0xfec03000 Pentium Pro MTRR support enabled ACPI-0660: *** Warning: Type override - [DEB_] had invalid type (Integer) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [MLIB] had invalid type (Integer) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [IO__] had invalid type (Integer) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [DATA] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [SIO_] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [SB__] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [PM__] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [ICNT] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [ACPI] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [IORG] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [SB__] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [PM__] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [SIO_] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [PM__] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [BIOS] had invalid type (Integer) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [CMOS] had invalid type (Integer) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [KBC_] had invalid type (Integer) for Scope operator, changed to (Scope) ACPI-0660: *** Warning: Type override - [OEM_] had invalid type (Integer) for Scope operator, changed to (Scope) npx0: on motherboard npx0: INT 16 interface acpi0: on motherboard acpi0: power button is handled as a fixed feature programming model. Timecounter "ACPI-safe" frequency 3579545 Hz pcibios: BIOS version 2.10 Using $PIR table, 11 entries at 0xc00f4a50 unknown: I/O range not supported acpi_timer0: <32-bit timer at 3.579545MHz> port 0x508-0x50b on acpi0 acpi_cpu0: on acpi0 acpi_cpu1: on acpi0 acpi_cpu2: on acpi0 acpi_cpu3: on acpi0 acpi_cpu4: on acpi0 acpi_cpu5: on acpi0 acpi_cpu6: on acpi0 acpi_cpu7: on acpi0 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 IOAPIC #1 intpin 6 -> irq 2 IOAPIC #1 intpin 12 -> irq 9 IOAPIC #1 intpin 10 -> irq 10 ahc0: port 0xe400-0xe4ff mem 0xfebfe000-0xfebfefff irq 2 at device 6.0 on pci0 aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs em0: port 0xd800-0xd83f mem 0xfeb80000-0xfeb9ffff irq 9 at device 8.0 on pci0 em0: Speed:100 Mbps Duplex:Full em1: port 0xe000-0xe03f mem 0xfeba0000-0xfebbffff irq 10 at device 9.0 on pci0 em1: Speed:N/A Duplex:N/A pci0: at device 11.0 (no driver attached) atapci0: port 0xffa0-0xffaf,0x374-0x377,0x170-0x177,0x3f4-0x3f7,0x1f0-0x1f7 at device 15.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 isab0: at device 15.3 on pci0 isa0: on isab0 atkbdc0: port 0x64,0x60 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 fdc0: cmd 3 failed at out byte 1 of 3 fdc0: cmd 3 failed at out byte 1 of 3 orm0: