From owner-freebsd-stable Wed Apr 4 22:27:10 2001 Delivered-To: freebsd-stable@freebsd.org Received: from mail4.lig.bellsouth.net (mail4.lig.bellsouth.net [207.203.120.43]) by hub.freebsd.org (Postfix) with ESMTP id 5681437B496 for ; Wed, 4 Apr 2001 22:27:03 -0700 (PDT) (envelope-from fury@unixforever.net) Received: from damaged (adsl-78-132-242.btr.bellsouth.net [216.78.132.242]) by mail4.lig.bellsouth.net (3.3.5alt/0.75.2) with SMTP id BAA11329; Thu, 5 Apr 2001 01:27:01 -0400 (EDT) Message-ID: <002801c0bd91$517c92c0$0204a8c0@unixforever.net> From: "fury" To: "anarcat" , References: <20010404231722.A25165@dojo> Subject: Re: Spontaneous reboot, no panic. Recovering from a crash. Date: Thu, 5 Apr 2001 00:28:48 -0500 MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 5.00.2615.200 X-MimeOLE: Produced By Microsoft MimeOLE V5.00.2615.200 Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG First of all sorry about the quote, I'm in my makeshift emailer at the moment. Second, in response to your email, yes I have often received the same response. I have managed to fix it for now basically on blind luck. No it was NOT a hardware issue unlike many ppl stated, fact is, I have never figured it out. There was one machine effects, where not the same as the other *identical* machine. CVSup seemed to fix the spuratic one, but with the only machine differences being two network cards of the same brand and model, and in a series of serial numbers no less. First my suspect was the adaptech 39160, but no, I swapped the good with the bad and no luck. Then I tried a different video card, nope, different network cards, nope, memory nope. I essentially went through every peice of hardware in the machine including the motherboard, and power supply but not the seagate cheetah. Go figure, so I gave up on the list, but I will not do unto others that sound like the same situation as it was done unto me at that time. Even if this is insignifigant, at least you have a response, which is more than *I* got. But what I think it all came down to, was probably a bad CVSup tree that finally I just caught a good tree and after that all was well again. Maybe you have some enlightenment from this, maybe not. I sure hope so, at least *I* respond if I think its even close to relevant. -fury ----- Original Message ----- From: anarcat To: Sent: Wednesday, April 04, 2001 11:17 PM Subject: Spontaneous reboot, no panic. Recovering from a crash. Hi. I need to share the experience I just had with a very particular crash. If any of you had similar experiences, please share. Critics and suggestions on procedure are welcome. I really don't know how to put this, but since we're in a code freeze and I'm running stable, if a few people witness this behavior, well... Anyways... A bit of background: FreeBSD shall.anarcat.yi.org 4.3-BETA FreeBSD 4.3-BETA #2: Wed Mar 14 18:50:48 EST 2001 root@:/usr/obj/usr/src/sys/SHALL i386 A few minutes ago, I was doing a few things at the same time on my machine. I was doing a make install clean in sysutils/[gcombust|gtoaster|xcdroast], copying /mnt/cdr to ~/cdr, and opening a ssh session to my shell account. Before I could get to a password prompt on the shell account, I didn't even had time to realize what's going on that my computer beeped and was back to the BIOS POST startup procedure. Bad. I was a bit distracted at the time it happened so I have no idea what happened exactly. When FreeBSd restarted, fsck wasn't able to fix everything automagically, so I had to run it from single-user mode. I had so much scrapped files that I interrupted the process at some point to restart it with "fsck -y". Things like UNREF DIR, LINK COUNT DIR, UNREF FILE, and all sort of evil warnings of the kind. Now my system seems to be running fine, but I did not run extensive tests. I do not have tripwire or any filesystem integrity checker and I have no clue on how I can figure out what was scrapped. I am unable to provide debug info as there was no panic, as far as I can tell. I can say, however, that there are 348 files in /usr/lost+found. Yay. All of which have file mod dates set before the crash, this morning, actually. But I have the following info: The "crash" has somehow been recorded in wtmp. last says: anarcat ttyp1 :0 Mer 4 avr 23:45 - 23:47 (00:01) reboot ~ Mer 4 avr 23:43 anarcat ttyp4 :1 Mer 4 avr 23:26 - crash (00:17) Oddly enough, dmesg was not record in /var/log/messages after bootup. Last thing I have there is: Apr 4 23:23:47 shall su: anarcat to root on /dev/ttyp2 Apr 4 23:24:47 shall su: anarcat to root on /dev/ttyp3 Apr 4 23:43:07 shall ntpd[223]: ntpd 4.0.99b Fri Mar 9 00:52:31 EST 2001 (1) Apr 4 23:43:07 shall ntpd[223]: using kernel phase-lock loop 2040 Where there should have been dmesg info somewhere in between. The 2 su's are my make install attempts before the crash. I really don't see what other info I could find on that. This morning's dmesg output follows. My next step is to remake to world after a fresh cvsup, because from the stuff in lost+found, I lost a few files in /usr/src. I read a few 'spontaneous reboots' threads before and I guess this comes to the same thing. Forgive and forget. "Probably a hardware problem". :) Anyways, if I have any fresh info on that, I'll keep the list informed, but as of now, just consider this mail as a friendly marker of a local apocalypse. Don't do as daddy does, boys and girls. Do backups. Often. You never know when the shit'll the fan. :) Dmesg output. Please note that the cd used for the copy was the scsi one. Copyright (c) 1992-2001 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.3-BETA #2: Wed Mar 14 18:50:48 EST 2001 root@:/usr/obj/usr/src/sys/SHALL Timecounter "i8254" frequency 1193182 Hz CPU: Pentium/P55C (167.05-MHz 586-class CPU) Origin = "GenuineIntel" Id = 0x543 Stepping = 3 Features=0x8001bf real memory = 33554432 (32768K bytes) config> dis sio2 config> q avail memory = 29335552 (28648K bytes) Preloaded elf kernel "kernel" at 0xc036a000. Preloaded userconfig_script "/boot/kernel.conf" at 0xc036a09c. Intel Pentium detected, installing workaround for F00F bug md0: Malloc disk npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard pci0: on pcib0 isab0: at device 7.0 on pci0 isa0: on isab0 atapci0: port 0xf000-0xf00f at device 7.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 pci0: at 7.2 irq 11 chip1: port 0x5000-0x500f at device 7.3 on pci0 pci0: at 9.0 ahc0: port 0x6400-0x64ff mem 0xe1001000-0xe1001fff irq 10 at device 10.0 on pci0 aic7860: Single Channel A, SCSI Id=7, 3/255 SCBs vr0: port 0x6a00-0x6aff mem 0xe1002000-0xe10020ff irq 9 at device 11.0 on pci0 vr0: Ethernet address: 00:50:ba:64:e3:6a miibus0: on vr0 ukphy0: on miibus0 ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto atkbdc0: at port 0x60,0x64 on isa0 atkbd0: flags 0x1 irq 1 on atkbdc0 kbd0 at atkbd0 vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> fdc0: at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1 at port 0x2f8-0x2ff irq 3 on isa0 sio1: type 16550A ppc0: at port 0x378-0x37f irq 7 on isa0 ppc0: Generic chipset (EPP/NIBBLE) in COMPATIBLE mode ppi0: on ppbus0 plip0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port pca0 at port 0x40 on isa0 joy0 at port 0x201 on isa0 sbc0: at port 0x220-0x22f,0x330-0x331,0x388-0x38b irq 5 drq 1,5 on isa0 pcm0: on sbc0 unknown: can't assign resources ata0-master: DMA limited to UDMA33, non-ATA66 compliant cable ad0: 29311MB [59554/16/63] at ata0-master UDMA33 acd0: CDROM at ata0-slave using PIO3 Waiting 2 seconds for SCSI devices to settle Mounting root from ufs:/dev/ad0s2a cd0 at ahc0 bus 0 target 3 lun 0 cd0: Removable CD-ROM SCSI-2 device cd0: 20.000MB/s transfers (20.000MHz, offset 15) cd0: cd present [279683 x 2048 byte records] The Anarcat. To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message