Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 12 Jul 2004 00:19:20 +0100
From:      Paul Civati <paul@xciv.org>
To:        freebsd-stable@freebsd.org
Subject:   Re: SMP and NMI errors (4.10)
Message-ID:  <57409.1089587960@xciv.org>
In-Reply-To: Your message of "Sun, 11 Jul 2004 15:34:52 PDT." <20040711153233.D76940@carver.gumbysoft.com> 

next in thread | previous in thread | raw e-mail | index | archive | help
Doug White <dwhite@gumbysoft.com> wrote:

> NMIs sometimes get triggered for ECC corrections, which is why a
> memtester wouldn't see it. 

Doh, yeah.

> I think there is a kernel option somewhere that hooks
> NMI and attempts to get information from the platform as to what DIMM
> triggered it.

Can't see anything for that, and there is only one DIMM, I have a 
second one to go in that I can swap to test.

> Otherwise you might check the Event Log in the BIOS for ECC events.

Alas no BIOS event log for this mobo.

> > If I boot a uniprocessor kernel this problem doesn't occur.
> It might be temperature related then :)

I was hoping no-one would say that :)

This is a 1U rack mount and currently seems to run at about ~40 deg. C
on CPU1, ~30 deg. C on CPU2 and ~33 deg. C system temp.  First CPU 
is a little hot perhaps but I don't think too high to be a problem?

-Paul-



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?57409.1089587960>