From owner-freebsd-questions@FreeBSD.ORG Sun Oct 7 11:55:05 2007 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E26EC16A418 for ; Sun, 7 Oct 2007 11:55:04 +0000 (UTC) (envelope-from gpeel@thenetnow.com) Received: from thenetnow.com (thenetnow.com [69.90.69.141]) by mx1.freebsd.org (Postfix) with ESMTP id B78E413C45D for ; Sun, 7 Oct 2007 11:55:04 +0000 (UTC) (envelope-from gpeel@thenetnow.com) Received: from hpeel.ody.ca ([216.240.12.2] helo=GRANT) by constellation.thenetnow.com with esmtpa (Exim 4.63 (FreeBSD)) (envelope-from ) id 1IeUin-000FZM-NF; Sun, 07 Oct 2007 07:54:57 -0400 Message-ID: <008201c808d8$dfd697c0$6501a8c0@GRANT> From: "Grant Peel" To: "Gary Kline" , "Garrett Cooper" References: <009c01c80810$169e4830$6501a8c0@GRANT> <4707A770.9060804@u.washington.edu> <20071007021558.GB67456@thought.org> Date: Sun, 7 Oct 2007 07:54:53 -0400 Organization: The Net Now MIME-Version: 1.0 X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2900.3138 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.3138 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: FreeBSD Mailing List Subject: Re: Server Reboot X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: Grant Peel List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 07 Oct 2007 11:55:05 -0000 ----- Original Message -----=20 From: Gary Kline=20 To: Garrett Cooper=20 Cc: Grant Peel ; FreeBSD Mailing List=20 Sent: Saturday, October 06, 2007 10:15 PM Subject: Re: Server Reboot On Sat, Oct 06, 2007 at 08:19:12AM -0700, Garrett Cooper wrote: > Grant Peel wrote: > >Hi all, > > > >This is the first time in 10 years I have seen this. > > > >I have a Dell PE750 (vintage 2004), running FreeBSD 6.2 that had = been=20 > >up and running for about 30 days without any issues. > > > >The server somehow rebooted last night, apparently, all by itself. > > > >The last log file line I can find waqs about 12:30 AM. The dmesg = shows=20 > >it restarted about 1:12 AM. dmesg shows some file errors that were=20 > >fixed upon reboot, other that that, everything is back up and = running=20 > >normally. > > > >I was wondering if anyone has seen anything similar and if a cause = was=20 > >found. > > > >Here is what I know: > > > >-all servers (there are 5 more) are plugged into the same power bar = > >and none of the others were affected > >-none of the standard logs show any intrusion or root log in = attempt, > >-dmesg and console log show nothing of note, > >-the DRAC logs and ESM logs show nothing, > >-the sensors (temp,voltage,etc) logs currently show no issues, all=20 > >well withing normal parms. > >-my MRTG logs show no abnormal CPU usage or network activity. > > > > > >Any help would be appreciated, > > > >-Grant >=20 > Check the capacitors on the motherboard (in particular near the=20 > memory and processor); they may be going bad (esp with that vintage. = > 2004 Dell was a bad year =3DP..). > You'll be looking for swelled capacitors and possibly some orange=20 > dialectric being emitted. > -Garrett Strange. In just the past few, 2 or 3 or even 4 weeks my=20 Dell-8200 has spontaneouslyrebooted too. I do have a number of things in /var/log/messages, but nothing that I can seee that would cause this problem. Before the video-card started flaking out, this puppy ran for weeks/months happily. AFAIW, X (or a heavily-loaded system) shouldn't have aynything to do with this=20 problem, [yes/no??]. Any clues, Garrett?=20 Ah, wait: dmesg.yesterday says=20 rl0: link state changed to UP pid 729 (Xorg), uid 0: exited on signal 6 (core dumped) pid 4475 (Xorg), uid 0: exited on signal 6 (core dumped) pid 60174 (firefox-bin), uid 1000: exited on signal 11 (core dumped) pid 47564 (as), uid 0: exited on signal 11 (core dumped) pid 47570 (as), uid 0: exited on signal 11 (core dumped) pid 79051 (as), uid 0: exited on signal 11 (core dumped) pid 79057 (as), uid 0: exited on signal 11 (core dumped) pid 3625 (as), uid 0: exited on signal 11 (core dumped) pid 3631 (as), uid 0: exited on signal 11 (core dumped) pid 74013 (conftest), uid 0: exited on signal 12 (core dumped) This file is timestamped 03 Oct 07 at 03:17 Anybody know why firefox would core dump? I have no clue waht "conftest" is... . Grant, how oten has your system failed? gary Gary, I have owned this server since new (in 2004), and this is the first = time it has done this. I also have another PE750 that was bought and = deployed the same time as this one and it has never done this. I am not running anything graphical on this, so I am guessing its not = the built in video card. It is running as a server only. Apache 2, = Mysql, 4PHP4, Perl5, Exim4, vm-pop3d, ipa, Openwebmail, and a number of = add in modules for all the above. One thing I may have neglected in my original post, is that it appears = the system may have been locked for a while since the last log entry I = can find befor the reboot was at about 12:20 am, the system then shows = the reboot at about 1:20 AM. -Grant --=20 Gary Kline kline@thought.org www.thought.org Public Service Unix http://jottings.thought.org http://transfinite.thought.org -------------------------------------------------------------------------= ----- Total Control Panel Login =20 To: gpeel@thenetnow.com Message Score: 50 High (60): Pass =20 From: kline@tao.thought.org My Spam Blocking Level: High = Medium (75): Pass =20 Low (90): Pass=20 Block messages from this sender (blacklist) =20 =20 This message was delivered because the content filter score did = not exceed your filter level. =20