From owner-freebsd-questions Sun Feb 25 20:16:23 2001 Delivered-To: freebsd-questions@freebsd.org Received: from va.com.au (va.com.au [203.15.106.1]) by hub.freebsd.org (Postfix) with ESMTP id BC4BA37B491 for ; Sun, 25 Feb 2001 20:16:11 -0800 (PST) (envelope-from jesse@va.com.au) Received: from [10.0.1.52] (203.164.93.134) by va.com.au with ESMTP (Eudora Internet Mail Server 2.2.2); Mon, 26 Feb 2001 14:46:05 +1030 Mime-Version: 1.0 X-Sender: jesse@mail.va.com.au Message-Id: Date: Mon, 26 Feb 2001 15:14:59 +1100 To: freebsd-questions@freebsd.org From: jesse reynolds Subject: hardware fault or software problem? (machine freezes) Cc: nigel@4trak.net Content-Type: text/plain; charset="us-ascii" ; format="flowed" Sender: owner-freebsd-questions@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG Hi folx Any advice is greatly appreciated here... I have a server that's about 4 months old. It's started crashing a fair bit, and when it crashed it completely freezes instantly, not giving the OS a change to log anything, so it doesn't reboot. The soft-power button on the front does not turn succeed in turning off the machine, but the reset button does work, as does ripping the power out of the back! This to me sounds like a motherboard issue, but I guess it's possible it's a software problem. What I do get in my messages file is alot of random read errors from the primary hard drive, which is running on UDMA/66 (promise controller onboard a gigabyte motherboard). Is it possible the hardware or software gives up after a certain number of read errors? (each time the read is retried and is fine the second time, and it's completely random which block on the hard disk is going to error out). I'm running FreeBSD 4.1.1-RELEASE, here is a dmesg. My question is should I a) changeover the motherboard, b) upgrade FreeBSD, c) both a and b, or d) get completely new hardware Thanks! -jesse bash-2.04# dmesg | more Copyright (c) 1992-2000 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.1.1-RELEASE #0: Tue Sep 26 00:46:59 GMT 2000 jkh@narf.osd.bsdi.com:/usr/src/sys/compile/GENERIC Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon/Celeron (566.11-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x683 Stepping = 3 Features=0x383f9ff real memory = 268369920 (262080K bytes) avail memory = 257155072 (251128K bytes) Preloaded elf kernel "kernel" at 0xc0416000. Pentium Pro MTRR support enabled md0: Malloc disk npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard pci0: on pcib0 pcib1: at device 1.0 on pci0 pci1: on pcib1 pci1: at 0.0 isab0: at device 7.0 on pci0 isa0: on isab0 atapci0: port 0xf000-0xf00f at device 7.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 uhci0: port 0xa000-0xa01f irq 11 at device 7.2 on pci0 usb0: on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered uhub1: vendor 0x0000 product 0x0e01, class 9/0, rev 1.10/0.04, addr 2 uhub1: 4 ports with 4 removable, self powered chip1: port 0x5000-0x500f at device 7.3 on pci0 ahc0: port 0xa400-0xa4ff mem 0xe8222000-0xe8222fff irq 10 at device 9. 0 on pci0 aic7860: Single Channel A, SCSI Id=7, 3/255 SCBs fxp0: port 0xa800-0xa81f mem 0xe8000000-0xe80fffff,0xe8220000-0xe8 220fff irq 5 at device 12.0 on pci0 fxp0: Ethernet address 00:a0:c9:c9:13:83 fxp1: port 0xac00-0xac3f mem 0xe8100000-0xe81fffff,0xe8221000-0xe8 221fff irq 10 at device 13.0 on pci0 fxp1: Ethernet address 00:d0:b7:3b:3c:d2 atapci1: port 0xc000-0xc03f,0xbc00-0xbc03,0xb800-0xb807,0xb400-0xb403,0xb 000-0xb007 mem 0xe8200000-0xe821ffff irq 9 at device 14.0 on pci0 ata2: at 0xb000 on atapci1 ata3: at 0xb800 on atapci1 fdc0: at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: at port 0x60,0x64 on isa0 atkbd0: flags 0x1 irq 1 on atkbdc0 kbd0 at atkbd0 vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1 at port 0x2f8-0x2ff irq 3 on isa0 sio1: type 16550A ppc0: at port 0x378-0x37f irq 7 on isa0 ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode plip0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 ad4: 19623MB [39870/16/63] at ata2-master using UDMA66 ad6: 19623MB [39870/16/63] at ata3-master using UDMA66 acd0: CDROM at ata0-master using PIO4 Waiting 15 seconds for SCSI devices to settle Mounting root from ufs:/dev/ad4s1a WARNING: / was not properly dismounted ad4: UDMA ICRC READ ERROR blk# 132928 retrying ad4: UDMA ICRC READ ERROR blk# 6816704 retrying ad4: UDMA ICRC READ ERROR blk# 7799856 retrying ad4: UDMA ICRC READ ERROR blk# 9504576 retrying ad4: UDMA ICRC READ ERROR blk# 14747456 retrying ad4: UDMA ICRC READ ERROR blk# 19595328 retrying ad4: UDMA ICRC READ ERROR blk# 20382320 retrying ad4: UDMA ICRC READ ERROR blk# 20709552 retrying ad4: UDMA ICRC READ ERROR blk# 22217216 retrying ad4: UDMA ICRC READ ERROR blk# 22218000 retrying ad4: UDMA ICRC READ ERROR blk# 26740320 retrying ad4: UDMA ICRC READ ERROR blk# 29426176 retrying ad4: UDMA ICRC READ ERROR blk# 30212160 retrying ad4: UDMA ICRC READ ERROR blk# 34670400 retrying ad4: UDMA ICRC READ ERROR blk# 36045760 retrying ad4: UDMA ICRC READ ERROR blk# 39452960 retrying ad4: UDMA ICRC READ ERROR blk# 4456512 retrying ad4: UDMA ICRC READ ERROR blk# 4522944 retrying ad4: UDMA ICRC READ ERROR blk# 8531344 retrying ad4: UDMA ICRC READ ERROR blk# 7363056 retrying bash-2.04# -- Jesse Reynolds - Virtual Artists Pty Ltd - http://www.va.com.au Email: jesse (at) va.com.au > Web Hosting Phone: +61 8 8223 2288 > Streaming Media Services ?: http://jesse.va.com.au > Telehousing / Colocation > Internet Systems Consulting > Internet Application Design http://www.google.com/search?q=site:va.com.au+fun To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-questions" in the body of the message