From owner-freebsd-smp@FreeBSD.ORG Sun Jul 27 05:50:17 2003 Return-Path: Delivered-To: freebsd-smp@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 4ADD737B401; Sun, 27 Jul 2003 05:50:17 -0700 (PDT) Received: from alfred.oau.org (2.208.27.24.cfl.rr.com [24.27.208.2]) by mx1.FreeBSD.org (Postfix) with ESMTP id 6868D43FA3; Sun, 27 Jul 2003 05:50:15 -0700 (PDT) (envelope-from elliot@alfred.oau.org) Received: from alfred.oau.org (localhost [127.0.0.1]) by alfred.oau.org (8.12.8p1/8.12.8) with ESMTP id h6RCoEWb000337 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Sun, 27 Jul 2003 08:50:14 -0400 (EDT) (envelope-from ebd@oau.org) Received: (from elliot@localhost) by alfred.oau.org (8.12.8p1/8.12.8/Submit) id h6RCoD1a000336; Sun, 27 Jul 2003 08:50:13 -0400 (EDT) Message-Id: <200307271250.h6RCoD1a000336@alfred.oau.org> To: freebsd-stable@freebsd.org, freebsd-smp@freebsd.org Date: Sun, 27 Jul 2003 08:50:13 -0400 (EDT) Sender: elliot@alfred.oau.org From: Elliot Dierksen X-Mailer: ELM [version 2.4ME+ PL99f (25)] MIME-Version: 1.0 Content-Type: multipart/mixed; boundary=ELM1059310213-252-0_ Content-Transfer-Encoding: 7bit Subject: SMP Problems with 4.8-RELEASE X-BeenThere: freebsd-smp@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: FreeBSD SMP implementation group List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 27 Jul 2003 12:50:17 -0000 --ELM1059310213-252-0_ Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=US-ASCII Hello, I have the following hardware configuration. Tyan 2466N motherboard, 1.03 BIOS, 2 Athlon-MP 1800+ CPUs, 512MB registered ECC DRAM, Onboard 3Com NIC (xl type) Radeon 8500 AGP video card Adaptec 3210S SCSI RAID controller Comtrol RocketPort 8 port PCI serial card I have started to experience some odd system lockups in the past couple months, and it seems to be getting worse. I particular, this seems to happen when the SCSI controller is very busy. I have a file system where I store dumps of the other file systems. On a weekly basis, I run these dumps. What has started happening is the system will lock up completely during one of the dumps. When this happens, the system is totally unresponsive. Even at the console, it is dead to the world. However, the status lights on the RAID controller seem to indicate that it is happy. The system has to be power cycled at that point. I have been unable to generate any debug information to help chase this issue. I have also had some issues with mozilla locking up and it could not be killed. When I exited from X, the system would lock as described earlier. As a desperate measure to try and get a backup, I built and installed a single CPU kernel. That seems to be resolving the problem. However, it would be nice to bring the 2nd CPU back on line. I looked through all the BIOS options, and I don't see anything specifying the MPS version. I would be happy to assist in troubleshooting, but I really don't know which direction to go at the moment. I have attached both kernel configuration files and dmesg output from both during the boot process. Any suggestions??? Thanx, EBD -- Elliot Dierksen "They that can give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety" -- Benjamin Franklin Home) ebd@oau.org (407) 850-9760 --ELM1059310213-252-0_ Content-Transfer-Encoding: 7bit Content-Type: text/plain Content-Disposition: attachment; filename=AMD Content-Description: # # AMD -- Generic kernel configuration file for FreeBSD/i386 # # For more information on this file, please read the handbook section on # Kernel Configuration Files: # # http://www.FreeBSD.org/handbook/kernelconfig-config.html # # The handbook is also available locally in /usr/share/doc/handbook # if you've installed the doc distribution, otherwise always see the # FreeBSD World Wide Web server (http://www.FreeBSD.org/) for the # latest information. # # An exhaustive list of options and more detailed explanations of the # device lines is also present in the ./LINT configuration file. If you are # in doubt as to the purpose or necessity of a line, check first in LINT. # # $FreeBSD: src/sys/i386/conf/AMD,v 1.246.2.38 2002/01/25 17:41:40 murray Exp $ machine i386 cpu I686_CPU ident AMD maxusers 0 #makeoptions DEBUG=-g #Build kernel with gdb(1) debug symbols options MATH_EMULATE #Support for x87 emulation options INET #InterNETworking options FFS #Berkeley Fast Filesystem options FFS_ROOT #FFS usable as root device [keep this!] options SOFTUPDATES #Enable FFS soft updates support options UFS_DIRHASH #Improve performance on big directories options MFS #Memory Filesystem options MD_ROOT #MD is a potential root device options NFS #Network Filesystem options NFS_ROOT #NFS usable as root device, NFS required options MSDOSFS #MSDOS Filesystem options CD9660 #ISO 9660 Filesystem options CD9660_ROOT #CD-ROM usable as root, CD9660 required options PROCFS #Process filesystem options COMPAT_43 #Compatible with BSD 4.3 [KEEP THIS!] options SCSI_DELAY=5000 #Delay (in ms) before probing SCSI options UCONSOLE #Allow users to grab the console options USERCONFIG #boot -c editor options VISUAL_USERCONFIG #visual boot -c editor options KTRACE #ktrace(1) support options SYSVSHM #SYSV-style shared memory options SYSVMSG #SYSV-style message queues options SYSVSEM #SYSV-style semaphores options P1003_1B #Posix P1003_1B real-time extensions options _KPOSIX_PRIORITY_SCHEDULING options ICMP_BANDLIM #Rate limit bad replies options KBD_INSTALL_CDEV # install a CDEV entry in /dev device isa device pci #device agp # Floppy drives device fdc0 at isa? port IO_FD1 irq 6 drq 2 device fd0 at fdc0 drive 0 device fd1 at fdc0 drive 1 # ATA and ATAPI devices device ata0 at isa? port IO_WD1 irq 14 device ata1 at isa? port IO_WD2 irq 15 device ata device atadisk # ATA disk drives device atapicd # ATAPI CDROM drives device atapifd # ATAPI floppy drives device atapist # ATAPI tape drives options ATA_STATIC_ID #Static device numbering # SCSI Controllers device ahc # AHA2940 and onboard AIC7xxx devices device sym # NCR/Symbios Logic (newer chipsets) options SYM_SETUP_LP_PROBE_MAP=0x40 # Allow ncr to attach legacy NCR devices when # both sym and ncr are configured # SCSI peripherals device scbus # SCSI bus (required) device da # Direct Access (disks) device sa # Sequential Access (tape etc) device cd # CD device pass # Passthrough device (direct SCSI access) # RAID controllers interfaced to the SCSI subsystem device asr # DPT SmartRAID V, VI and Adaptec SCSI RAID # atkbdc0 controls both the keyboard and the PS/2 mouse device atkbdc0 at isa? port IO_KBD device atkbd0 at atkbdc? irq 1 flags 0x1 device psm0 at atkbdc? irq 12 flags 0x1100 device vga0 at isa? # splash screen/screen saver pseudo-device splash # syscons is the default console driver, resembling an SCO console device sc0 at isa? flags 0x100 # Floating point support - do not disable. device npx0 at nexus? port IO_NPX irq 13 # Power management support (see LINT for more options) device apm0 at nexus? disable flags 0x20 # Advanced Power Management # Serial (COM) ports device sio0 at isa? port IO_COM1 flags 0x10 irq 4 device sio1 at isa? port IO_COM2 irq 3 device sio2 at isa? disable port IO_COM3 irq 5 device sio3 at isa? disable port IO_COM4 irq 9 # Parallel port device ppc0 at isa? irq 7 device ppbus # Parallel port bus (required) device lpt # Printer # PCI Ethernet NICs that use the common MII bus controller code. # NOTE: Be sure to keep the 'device miibus' line in order to use these NICs! device miibus # MII bus support device fxp # Intel EtherExpress PRO/100B (82557, 82558) device xl # 3Com 3c90x (``Boomerang'', ``Cyclone'') device em # Pseudo devices - the number indicates how many units to allocate. pseudo-device loop # Network loopback pseudo-device ether # Ethernet support pseudo-device tun # Packet tunnel. pseudo-device pty # Pseudo-ttys (telnet etc) pseudo-device md # Memory "disks" # The `bpf' pseudo-device enables the Berkeley Packet Filter. # Be aware of the administrative consequences of enabling this! pseudo-device bpf #Berkeley packet filter device smbus device smb device amdpm device rp device adv #options PSM_DEBUG=1 #options KBD_MAXRETRY=4 #options KBD_MAXWAIT=6 #options KBD_RESETDELAY=2000 options INCLUDE_CONFIG_FILE # Include this file in kernel --ELM1059310213-252-0_ Content-Transfer-Encoding: 7bit Content-Type: text/plain Content-Disposition: attachment; filename=dmesg.txt Content-Description: Copyright (c) 1992-2003 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.8-RELEASE #0: Sat Jul 26 19:59:12 EDT 2003 root@alfred.oau.org:/usr/obj/usr/src/sys/AMD Calibrating clock(s) ... TSC clock: 1533071659 Hz, i8254 clock: 1192936 Hz CLK_USE_I8254_CALIBRATION not specified - using default frequency Timecounter "i8254" frequency 1193182 Hz CLK_USE_TSC_CALIBRATION not specified - using old calibration method CPU: AMD Athlon(tm) MP 1800+ (1533.40-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0x662 Stepping = 2 Features=0x383fbff AMD Features=0xc0480000 Data TLB: 32 entries, fully associative Instruction TLB: 16 entries, fully associative L1 data cache: 64 kbytes, 64 bytes/line, 1 lines/tag, 2-way associative L1 instruction cache: 64 kbytes, 64 bytes/line, 1 lines/tag, 2-way associative L2 internal cache: 256 kbytes, 64 bytes/line, 1 lines/tag, 8-way associative real memory = 536870912 (524288K bytes) Physical memory chunk(s): 0x00001000 - 0x0009dfff, 643072 bytes (157 pages) 0x00390000 - 0x1fff7fff, 533102592 bytes (130152 pages) config> di ata0 config> di ata1 config> di sio1 config> en apm0 config> #flags psm0 0x1100 Invalid command or syntax. Type `?' for help. config> flags psm0 0x0100 config> flags atkbd0 0x0 config> q avail memory = 518905856 (506744K bytes) bios32: Found BIOS32 Service Directory header at 0xc00f7100 bios32: Entry = 0xfd650 (c00fd650) Rev = 0 Len = 1 pcibios: PCI BIOS entry at 0x120 pnpbios: Found PnP BIOS data at 0xc00f7160 pnpbios: Entry = f0000:97be Rev = 1.0 Other BIOS signatures found: ACPI: 00000000 Preloaded elf kernel "kernel" at 0xc0369000. Preloaded userconfig_script "/boot/kernel.conf" at 0xc03690a8. Pentium Pro MTRR support enabled Creating DISK md0 md0: Malloc disk Math emulator present pci_open(1): mode 1 addr port (0x0cf8) is 0x80004108 pci_open(1a): mode1res=0x80000000 (0x80000000) pci_cfgcheck: device 0 [class=060000] [hdr=00] is there (id=700c1022) Using $PIR table, 13 entries at 0xc00fdef0 apm0: on motherboard apm0: found APM BIOS v1.2, connected at v1.2 npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard found-> vendor=0x1022, dev=0x700c, revid=0x11 class=06-00-00, hdrtype=0x00, mfdev=0 subordinatebus=0 secondarybus=0 map[10]: type 1, range 32, base a0000000, size 29 map[14]: type 1, range 32, base 90400000, size 12 found-> vendor=0x1022, dev=0x700d, revid=0x00 class=06-04-00, hdrtype=0x01, mfdev=0 subordinatebus=1 secondarybus=1 found-> vendor=0x1022, dev=0x7440, revid=0x04 class=06-01-00, hdrtype=0x00, mfdev=1 subordinatebus=0 secondarybus=0 found-> vendor=0x1022, dev=0x7441, revid=0x04 class=01-01-8a, hdrtype=0x00, mfdev=0 subordinatebus=0 secondarybus=0 map[20]: type 1, range 32, base 0000f000, size 4 found-> vendor=0x1022, dev=0x7443, revid=0x03 class=06-80-00, hdrtype=0x00, mfdev=0 subordinatebus=0 secondarybus=0 found-> vendor=0x1044, dev=0xa500, revid=0x01 class=06-04-00, hdrtype=0x01, mfdev=1 subordinatebus=2 secondarybus=2 found-> vendor=0x1044, dev=0xa501, revid=0x01 class=0e-00-01, hdrtype=0x00, mfdev=1 subordinatebus=0 secondarybus=0 intpin=a, irq=11 map[10]: type 1, range 32, base c0000000, size 28 found-> vendor=0x1022, dev=0x7448, revid=0x04 class=06-04-00, hdrtype=0x01, mfdev=0 subordinatebus=3 secondarybus=3 pci0: on pcib0 pcib1: at device 1.0 on pci0 found-> vendor=0x1002, dev=0x514c, revid=0x00 class=03-00-00, hdrtype=0x00, mfdev=0 subordinatebus=0 secondarybus=0 intpin=a, irq=10 map[10]: type 1, range 32, base d0000000, size 28 map[14]: type 1, range 32, base 00002000, size 8 map[18]: type 1, range 32, base 90000000, size 16 pci1: on pcib1 pci1: (vendor=0x1002, dev=0x514c) at 5.0 irq 10 isab0: at device 7.0 on pci0 isa0: on isab0 atapci0: port 0xf000-0xf00f at device 7.1 on pci0 ata0: iobase=0x01f0 altiobase=0x03f6 bmaddr=0xf000 ata0: mask=03 ostat0=50 ostat2=00 ata0-master: ATAPI 14 eb ata0-slave: ATAPI 7f 7f ata0: mask=03 stat0=10 stat1=00 ata0: devices=04 ata0: at 0x1f0 irq 14 on atapci0 ata1: iobase=0x0170 altiobase=0x0376 bmaddr=0xf008 ata1: mask=03 ostat0=20 ostat2=30 ata1-master: ATAPI 20 20 ata1-slave: ATAPI 30 30 ata1: mask=03 stat0=20 stat1=30 ata1-master: ATA 25 25 ata1-slave: ATA 25 25 ata1: devices=00 ata1: at 0x170 irq 15 on atapci0 chip1: at device 7.3 on pci0 pcib2: at device 8.0 on pci0 pci2: on pcib2 asr0: mem 0xc0000000-0xcfffffff irq 11 at device 8.1 on pci0 asr0: major=154 asr0: ADAPTEC 3210S FW Rev. 3607, 2 channel, 256 CCBs, Protocol I2O pcib3: at device 16.0 on pci0 found-> vendor=0x10cd, dev=0x1200, revid=0x00 class=01-00-00, hdrtype=0x00, mfdev=0 subordinatebus=0 secondarybus=0 intpin=a, irq=3 map[10]: type 1, range 32, base 00003080, size 6 found-> vendor=0x11fe, dev=0x0005, revid=0x04 class=07-80-00, hdrtype=0x00, mfdev=0 subordinatebus=0 secondarybus=0 intpin=a, irq=10 map[10]: type 1, range 32, base 000030c0, size 6 found-> vendor=0x10b7, dev=0x9200, revid=0x78 class=02-00-00, hdrtype=0x00, mfdev=0 subordinatebus=0 secondarybus=0 intpin=a, irq=5 map[10]: type 1, range 32, base 00003000, size 7 map[14]: type 1, range 32, base 90101000, size 7 pci3: on pcib3 adv0: port 0x3080-0x30bf irq 3 at device 5.0 on pci3 adv0: AdvanSys SCSI Host Adapter, SCSI ID 7, queue depth 20 rp0: port 0x30c0-0x30ff irq 10 at device 6.0 on pci3 RocketPort0 = 8 ports rp0: driver is using old-style compatibility shims xl0: <3Com 3c905C-TX Fast Etherlink XL> port 0x3000-0x307f mem 0x90101000-0x9010107f irq 5 at device 8.0 on pci3 xl0: reset didn't complete xl0: Ethernet address: 00:e0:81:20:3b:c8 xl0: media options word: a xl0: found MII/AUTO miibus0: on xl0 ukphy0: on miibus0 ukphy0: OUI 0x00105a, model 0x0000, rev. 0 ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto bpf: xl0 attached ata-: ata0 exists, using next available unit number ata-: ata1 exists, using next available unit number Trying Read_Port at 203 Trying Read_Port at 243 Trying Read_Port at 283 Trying Read_Port at 2c3 Trying Read_Port at 303 Trying Read_Port at 343 Trying Read_Port at 383 Trying Read_Port at 3c3 isa_probe_children: disabling PnP devices isa_probe_children: probing non-PnP devices orm0: