From owner-freebsd-hardware@FreeBSD.ORG Wed Nov 11 18:05:06 2009 Return-Path: Delivered-To: hardware@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 65FF3106568F for ; Wed, 11 Nov 2009 18:05:06 +0000 (UTC) (envelope-from david@catwhisker.org) Received: from albert.catwhisker.org (adsl-63-193-123-122.dsl.snfc21.pacbell.net [63.193.123.122]) by mx1.freebsd.org (Postfix) with ESMTP id 04A688FC21 for ; Wed, 11 Nov 2009 18:05:05 +0000 (UTC) Received: from albert.catwhisker.org (localhost [127.0.0.1]) by albert.catwhisker.org (8.14.3/8.14.3) with ESMTP id nABHbmxI001853 for ; Wed, 11 Nov 2009 09:37:48 -0800 (PST) (envelope-from david@albert.catwhisker.org) Received: (from david@localhost) by albert.catwhisker.org (8.14.3/8.14.3/Submit) id nABHbmSq001852 for hardware@freebsd.org; Wed, 11 Nov 2009 09:37:48 -0800 (PST) (envelope-from david) Date: Wed, 11 Nov 2009 09:37:47 -0800 From: David Wolfskill To: hardware@freebsd.org Message-ID: <20091111173747.GA1150@albert.catwhisker.org> Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="ADZbWkCsHQ7r3kzd" Content-Disposition: inline User-Agent: Mutt/1.4.2.3i Cc: Subject: 7.2-STABLE i386 box crashing -- clues? X-BeenThere: freebsd-hardware@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: hardware@freebsd.org, David Wolfskill List-Id: General discussion of FreeBSD hardware List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 11 Nov 2009 18:05:06 -0000 --ADZbWkCsHQ7r3kzd Content-Type: multipart/mixed; boundary="Kj7319i9nmIyA2yE" Content-Disposition: inline --Kj7319i9nmIyA2yE Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Sometimes, I find hardware Seriously Annoying. I'm getting pretty close to that in the present case. I have a pretty normal machine ; it's an Intel D865GBF desktop system board in a rack-mount 2U chssis; a single 2.6 GHz CPU where I've (presently) deiabled hyperthreading (in an effort to un-complexify things). I've enabled KDB. I've gone into the BIOS to check the hardware "events log". I've set up serial console, and am running tip(1) to it from within script(1) from another machine. Nothing. No clues. Every once in a while, it just crashes -- hard. It loses video output at that point; Ctl+Alt+Esc doesn't appear to change anything; entering (say) "reset" blindly at that point has no apparent effect. Either a reset switch or a power cycle bring s the machine back up again =2E.. for a while. Someone suggested that memory might be at fault; it had 2 512 MB PC3200 DIMMs. Local place advertised having compatible Kingston memory of those specs for $10/DIMM; when I got there, they said that it was out of stock and they weren't getting any more, but they did have 1GB PC3200 DIMMs. So I went ahead & got them (though it was rather more than $20), and the next time it crashed, I swapped the memory. This morning when I got up, it had crashed again. I recalled that I had at one point been hoping to run backups (to tape) from the machine, and accordingly, had attached a SCSI host adaptor via PCI riser card. Since I had nothing actually connected to the card, I pulled it out of the machine before bringing it back up. (I also fleft around for excessively warm spots; nothing. All fans spin up, as well.) Well, it just crashed again. Flaky CPU? Flaky power supply? How might I tell? Attached dmesg.boot is from most recent boot, so it won't show the SCSI card I pulled. Please include me on replies, as I'm not subscribed to hardware@. Please include me on replies, as I'm not subscribed to hardware@. Reply-To sewt for convenience. I've set Reply-To for convenience. Thanks... Peace, david --=20 David H. Wolfskill david@catwhisker.org Depriving a girl or boy of an opportunity for education is evil. See http://www.catwhisker.org/~david/publickey.gpg for my public key. --Kj7319i9nmIyA2yE Content-Type: text/plain; charset=us-ascii Content-Disposition: attachment; filename="dmesg.boot" Copyright (c) 1992-2009 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 7.2-STABLE #173 r198984: Fri Nov 6 13:53:16 PST 2009 root@freebeast.catwhisker.org:/common/S3/obj/usr/src/sys/ALBERT i386 Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel(R) Pentium(R) 4 CPU 2.60GHz (2593.68-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0xf29 Stepping = 9 Features=0xbfebfbff Features2=0x4400 Logical CPUs per core: 2 real memory = 2145579008 (2046 MB) avail memory = 2089758720 (1992 MB) ACPI APIC Table: ioapic0 irqs 0-23 on motherboard kbd1 at kbdmux0 acpi0: on motherboard acpi0: [ITHREAD] acpi0: Power Button (fixed) acpi0: reservation of 0, a0000 (3) failed acpi0: reservation of 100000, 7fe00000 (3) failed Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 vgapci0: port 0xec00-0xec07 mem 0xf0000000-0xf7ffffff,0xffa80000-0xffafffff irq 16 at device 2.0 on pci0 agp0: on vgapci0 agp0: detected 892k stolen memory agp0: aperture size is 128M uhci0: port 0xdc00-0xdc1f irq 16 at device 29.0 on pci0 uhci0: [GIANT-LOCKED] uhci0: [ITHREAD] usb0: on uhci0 usb0: USB revision 1.0 uhub0: on usb0 uhub0: 2 ports with 2 removable, self powered uhci1: port 0xe000-0xe01f irq 19 at device 29.1 on pci0 uhci1: [GIANT-LOCKED] uhci1: [ITHREAD] usb1: on uhci1 usb1: USB revision 1.0 uhub1: on usb1 uhub1: 2 ports with 2 removable, self powered uhci2: port 0xe400-0xe41f irq 18 at device 29.2 on pci0 uhci2: [GIANT-LOCKED] uhci2: [ITHREAD] usb2: on uhci2 usb2: USB revision 1.0 uhub2: on usb2 uhub2: 2 ports with 2 removable, self powered uhci3: port 0xe800-0xe81f irq 16 at device 29.3 on pci0 uhci3: [GIANT-LOCKED] uhci3: [ITHREAD] usb3: on uhci3 usb3: USB revision 1.0 uhub3: on usb3 uhub3: 2 ports with 2 removable, self powered ehci0: mem 0xffa7fc00-0xffa7ffff irq 23 at device 29.7 on pci0 ehci0: [GIANT-LOCKED] ehci0: [ITHREAD] usb4: EHCI version 1.0 usb4: companion controllers, 2 ports each: usb0 usb1 usb2 usb3 usb4: on ehci0 usb4: USB revision 2.0 uhub4: on usb4 uhub4: 8 ports with 8 removable, self powered pcib1: at device 30.0 on pci0 pci1: on pcib1 fxp0: port 0xcc00-0xcc3f mem 0xff8ff000-0xff8fffff irq 20 at device 8.0 on pci1 miibus0: on fxp0 inphy0: PHY 1 on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp0: Ethernet address: 00:0c:f1:8f:fd:69 fxp0: [ITHREAD] isab0: at device 31.0 on pci0 isa0: on isab0 atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xffa0-0xffaf at device 31.1 on pci0 ata0: on atapci0 ata0: [ITHREAD] ata1: on atapci0 ata1: [ITHREAD] ichsmb0: port 0xd800-0xd81f irq 17 at device 31.3 on pci0 ichsmb0: [GIANT-LOCKED] ichsmb0: [ITHREAD] smbus0: on ichsmb0 smb0: on smbus0 pcm0: mem 0xffa7f800-0xffa7f9ff,0xffa7f400-0xffa7f4ff irq 17 at device 31.5 on pci0 pcm0: [ITHREAD] pcm0: primary codec not ready! pcm0: acpi_button0: on acpi0 atkbdc0: port 0x60,0x64 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] atkbd0: [ITHREAD] psm0: irq 12 on atkbdc0 psm0: [GIANT-LOCKED] psm0: [ITHREAD] psm0: model Generic PS/2 mouse, device ID 0 fdc0: port 0x3f0-0x3f1,0x3f2-0x3f3,0x3f4-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FILTER] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 sio0: configured irq 4 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: configured irq 4 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A, console sio0: [FILTER] cpu0: on acpi0 p4tcc0: on cpu0 pmtimer0 on isa0 orm0: at iomem 0xc0000-0xc9fff pnpid ORM0000 on isa0 ppc0: at port 0x378-0x37f irq 7 on isa0 ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode ppbus0: on ppc0 ppbus0: [ITHREAD] plip0: on ppbus0 plip0: WARNING: using obsoleted IFF_NEEDSGIANT flag lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 ppc0: [GIANT-LOCKED] ppc0: [ITHREAD] sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounter "TSC" frequency 2593681984 Hz quality 800 Timecounters tick every 1.000 msec ad0: 152627MB at ata0-master UDMA100 ad1: 238475MB at ata0-slave UDMA100 acd0: CDROM at ata1-master UDMA33 Trying to mount root from ufs:/dev/ad0s2a WARNING: / was not properly dismounted Limiting icmp unreach response from 235 to 200 packets/sec --Kj7319i9nmIyA2yE-- --ADZbWkCsHQ7r3kzd Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.13 (FreeBSD) iEYEARECAAYFAkr69moACgkQmprOCmdXAD1LgwCdE5dWURRR67kvcSehS2PxOakc +yMAn0hlMpEl4jg0t/hiuBoFEEAGHLAo =fhFm -----END PGP SIGNATURE----- --ADZbWkCsHQ7r3kzd--