Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 10 Nov 2002 22:46:30 -0400 (AST)
From:      "Marc G. Fournier" <scrappy@hub.org>
To:        freebsd-stable@freebsd.org
Subject:   Changes on Oct 28th break -STABLE kernel ...
Message-ID:  <20021110222143.D11716-100000@hub.org>

next in thread | raw e-mail | index | archive | help

Evening ...

  After spending today doing an incremental upgrade, starting from Oct12th
(my last stable), going to Oct12th, Oct19 and Oct27th (all sucessful), I
drop'd down to a 'daily incremental' ... Oct28th (the kernel I am now
running) appears to work great, but as soon as I go to Oct29th, I can't
get it to come up ...

  From cvsup, the changes between Oct28th (midnight) and Oct29th are:

Connected to cvsup.FreeBSD.org
Updating collection src-all/cvs
 Edit src/etc/MAKEDEV
 Edit src/sys/conf/files
 Edit src/sys/dev/amr/amr.c
 Edit src/sys/dev/amr/amr_cam.c
 Edit src/sys/dev/amr/amr_compat.h
 Edit src/sys/dev/amr/amr_disk.c
 Edit src/sys/dev/amr/amr_pci.c
 Edit src/sys/dev/amr/amr_tables.h
 Edit src/sys/dev/amr/amrio.h
 Edit src/sys/dev/amr/amrreg.h
 Edit src/sys/dev/amr/amrvar.h
 Edit src/sys/modules/amr/Makefile
 Edit src/sys/netinet/ip_fw.c
Finished successfully

  The AMR driver is what I'm running:

venus# grep amr /var/run/dmesg.boot
amr0: flushing cache...done
amr0: <AMI MegaRAID> mem 0xfc1f0000-0xfc1fffff irq 11 at device 2.0 on pci1
amr0: <Series 475 40 Logical Drive Firmware> Firmware E161, BIOS 3.13, 32MB RAM
amrd0: <MegaRAID logical drive> on amr0
amrd0: 105000MB (215040000 sectors) RAID 5 (optimal)
Mounting root from ufs:/dev/amrd0s1a

  I've made not changes to my kernel config (or the hardware) between the
Oct28 and Oct29th kernels ...

  This server is a remote server, so 'hands on' debugging is difficult ...
the folks at Rackspace try and provide what they can, based on what is on
the console, which, in this case:

"Your kernal failed when tring to boot the second processor."

  So, it seems that the 'hang @ "SMP: AP CPU #1 Launched!"' problem
started with code around the Oct28/Oct29th period ...

  According to /var/run/dmesg.boot, the following is what happens after
the "SMP: AP CPU #1 Launched!" message comes up on a good boot:

SMP: AP CPU #1 Launched!
sa0 at sym0 bus 0 target 0 lun 0
sa0: <SONY SDX-700C 0101> Removable Sequential Access SCSI-2 device
sa0: 80.000MB/s transfers (40.000MHz, offset 31, 16bit)
Mounting root from ufs:/dev/amrd0s1a

  and, on a good boot, the dmesg.boot looks like:

Copyright (c) 1992-2002 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
	The Regents of the University of California. All rights reserved.
FreeBSD 4.7-STABLE #20: Sun Nov 10 18:55:29 CST 2002
    root@venus.hub.org:/usr/obj/usr/src/sys/kernel
Timecounter "i8254"  frequency 1193182 Hz
CPU: Pentium III/Pentium III Xeon/Celeron (1262.67-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x6b1  Stepping = 1
  Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE>
real memory  = 4227858432 (4128768K bytes)
avail memory = 4120436736 (4023864K bytes)
Programming 16 pins in IOAPIC #0
IOAPIC #0 intpin 2 -> irq 0
Programming 16 pins in IOAPIC #1
FreeBSD/SMP: Multiprocessor motherboard
 cpu0 (BSP): apic id:  0, version: 0x00040011, at 0xfee00000
 cpu1 (AP):  apic id:  1, version: 0x00040011, at 0xfee00000
 io0 (APIC): apic id:  4, version: 0x000f0011, at 0xfec00000
 io1 (APIC): apic id:  5, version: 0x000f0011, at 0xfec01000
Preloaded elf kernel "kernel" at 0xc02a7000.
Pentium Pro MTRR support enabled
Using $PIR table, 10 entries at 0xc00f51c0
npx0: <math processor> on motherboard
npx0: INT 16 interface
pcib0: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard
IOAPIC #1 intpin 6 -> irq 2
IOAPIC #1 intpin 4 -> irq 5
IOAPIC #1 intpin 5 -> irq 9
pci0: <PCI bus> on pcib0
pci0: <ATI Mach64-GR graphics accelerator> at 1.0 irq 2
pci0: <unknown card> (vendor=0x8086, dev=0x1229) at 4.0 irq 5
pci0: <unknown card> (vendor=0x8086, dev=0x1229) at 5.0 irq 9
isab0: <ServerWorks IB6566 PCI to ISA bridge> at device 15.0 on pci0
isa0: <ISA bus> on isab0
pci0: <Unknown PCI ATA controller> at 15.1
pci0: <OHCI USB controller> at 15.2 irq 10
pcib1: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard
IOAPIC #1 intpin 11 -> irq 11
IOAPIC #1 intpin 7 -> irq 16
IOAPIC #1 intpin 8 -> irq 17
pci1: <PCI bus> on pcib1
amr0: <AMI MegaRAID> mem 0xfc1f0000-0xfc1fffff irq 11 at device 2.0 on pci1
amr0: <Series 475 40 Logical Drive Firmware> Firmware E161, BIOS 3.13, 32MB RAM
sym0: <896> port 0xe400-0xe4ff mem 0xfebc8000-0xfebc9fff,0xfebe0000-0xfebe03ff irq 16 at device 3.0 on pci1
sym0: Symbios NVRAM, ID 7, Fast-40, SE, parity checking
sym0: open drain IRQ line driver, using on-chip SRAM
sym0: using LOAD/STORE-based firmware.
sym0: handling phase mismatch from SCRIPTS.
sym1: <896> port 0xe800-0xe8ff mem 0xfebe8000-0xfebe9fff,0xfebf0000-0xfebf03ff irq 17 at device 3.1 on pci1
sym1: Symbios NVRAM, ID 7, Fast-40, LVD, parity checking
sym1: open drain IRQ line driver, using on-chip SRAM
sym1: using LOAD/STORE-based firmware.
sym1: handling phase mismatch from SCRIPTS.
orm0: <Option ROMs> at iomem 0xc0000-0xc7fff,0xc9800-0xca7ff,0xca800-0xcb7ff,0xcb800-0xcbfff on isa0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
APIC_IO: Testing 8254 interrupt delivery
APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0 intpin 2
APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0
IP packet filtering initialized, divert disabled, rule-based forwarding enabled, default to accept, logging disabled
Waiting 15 seconds for SCSI devices to settle
(noperiph:sym0:0:-1:-1): SCSI BUS reset delivered.
(noperiph:sym1:0:-1:-1): SCSI BUS reset delivered.
amrd0: <MegaRAID logical drive> on amr0
amrd0: 105000MB (215040000 sectors) RAID 5 (optimal)
SMP: AP CPU #1 Launched!
sa0 at sym0 bus 0 target 0 lun 0
sa0: <SONY SDX-700C 0101> Removable Sequential Access SCSI-2 device
sa0: 80.000MB/s transfers (40.000MHz, offset 31, 16bit)
Mounting root from ufs:/dev/amrd0s1a

  The motherboard is: Thunder Le-T with bios version 1.06 ...

  Thoughts?  anything else I can provide to narrow down, and fix, the
problem?

thanks ...



To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-stable" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20021110222143.D11716-100000>