Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 15 Jan 2009 15:58:56 +0000
From:      Pete French <petefrench@ticketswitch.com>
To:        rwatson@FreeBSD.org
Cc:        freebsd-stable@freebsd.org, drosih@rpi.edu, rblayzor.bulk@inoc.net
Subject:   Re: Big problems with 7.1 locking up :-(
Message-ID:  <E1LNUcS-0001dY-5e@dilbert.ticketswitch.com>
In-Reply-To: <alpine.BSF.2.00.0901151545001.45995@fledge.watson.org>

next in thread | previous in thread | raw e-mail | index | archive | help
> Given the inconsistency of the symptoms, I wouldn't preclude something 
> environmental: could it be that it was the bottom, or more likely, top box in 
> a rack and that your air conditioning isn't quite as effective there when the 
> outside temperature is above/below some threshold?

It's a possibility - but the two machines which were exhibiting the fault
are in Slough and Baton Rouge respectively, so under very diferent cliatic
conditions. Howevere, something, has chhnaged to make it stop locking up!
The USA one was doing it every couple of hours at the start of the week, and
the UK on wouldnt last more than half an hour at one point.

> Alternatively, could it be that the workload changed very slightly -- you're
> doing less DNS queries, or the network latency to the DNS server changed?

Also a possibility - that workload is entirely dependent on customer behaviour
which is an unpredictable beast!

> Certainly, whoever gave the advise on checking BIOS revisions is right: you 
> can spend a lot of time tracking down a bug to realize that one box has a 
> slightly different BIOS rev and therefore does/doesn't suffer from an obscure 
> SMI bug.

Yes, thats next on my list - make sure they are all on the same version.

> In any case, if it starts to reproduceably recur, send out mail and we can see
> if we can track it down some more.  BTW, did you establish if the version of 
> iLo you have has a remote NMI?  I seem to recall that some do, and being able 
> to deliver an NMI is really quite valuable.

OK, thanks. My iiLO2 appears to have the ability to generate an NMI oon
demand, so that could be used if/whhen the fault crops up again.

thanks, will let this lie for now and resurrect the thread when I can
get some more useful data.

-pete.



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?E1LNUcS-0001dY-5e>