Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 25 Dec 2010 16:27:37 +0300
From:      Sergey Kandaurov <pluknet@gmail.com>
To:        Carl Johnson <carlj@peak.org>
Cc:        freebsd-stable@freebsd.org
Subject:   Re: MCA messages after upgrade to 8.2-BEAT1
Message-ID:  <AANLkTin_77Y=hYDVq7UYqGAq=FwwWFKqVZt6O%2BKKU2Tp@mail.gmail.com>
In-Reply-To: <87r5d6e9vb.fsf@oak.localnet>
References:  <4D11F1F5.7050902@quip.cz> <201012220957.26854.jhb@freebsd.org> <4D13A579.8090004@langille.org> <AANLkTi=yWh5_0anXyYs4u9KiKpZnC%2BqW%2BAG8YF1Ogbt2@mail.gmail.com> <87vd2iepkv.fsf@oak.localnet> <AANLkTin_2PsAL-N3Sct-S9Do7qCnR6-Dubwt1GOKgD-r@mail.gmail.com> <87r5d6e9vb.fsf@oak.localnet>

next in thread | previous in thread | raw e-mail | index | archive | help
On 25 December 2010 07:48, Carl Johnson <carlj@peak.org> wrote:
> Alan Cox <alan.l.cox@gmail.com> writes:
>
>> On Fri, Dec 24, 2010 at 5:08 PM, Carl Johnson <carlj@peak.org> wrote:
>>
>>> Alan Cox <alan.l.cox@gmail.com> writes:
>>>
>>> > 2010/12/23 Dan Langille <dan@langille.org>
>>> >
>>> >> On 12/22/2010 9:57 AM, John Baldwin wrote:
>>> >>
>>> >>> On Wednesday, December 22, 2010 7:41:25 am Miroslav Lachman wrote:
>>> >>>
>>> >>>> Dec 21 12:42:26 kavkaz kernel: MCA: Bank 0, Status 0xd40e400000000=
833
>>> >>>> Dec 21 12:42:26 kavkaz kernel: MCA: Global Cap 0x0000000000000105,
>>> >>>> Status 0x0000000000000000
>>> >>>> Dec 21 12:42:26 kavkaz kernel: MCA: Vendor "AuthenticAMD", ID 0x40=
f33,
>>> >>>> APIC ID 0
>>> >>>> Dec 21 12:42:26 kavkaz kernel: MCA: CPU 0 COR OVER BUSLG Source DR=
D
>>> >>>> Memory
>>> >>>> Dec 21 12:42:26 kavkaz kernel: MCA: Address 0x236493c0
>>> >>>>
>>> >>>
>>> >>> You are getting corrected ECC errors in your RAM. =A0You see them o=
nce an
>>> >>> hour
>>> >>> because we poll the machine check registers once an hour. =A0If thi=
s
>>> happens
>>> >>> constantly you might have a DIMM that is dying?
>>> >>>
>>> >>
>>> >> John:
>>> >>
>>> >> I take it these ECC errors *may* have been happening for some time. =
What
>>> >> has changed is the OS now polls for the errors and reports them.
>>> >>
>>> >>
>>> > Yes, we enabled MCA by default in 8.1-RELEASE.
>>>
>>> Is there some reason that it is only available for i386 and not for
>>> amd64? =A0Linux has something called mcelog, for machine check errors,
>>> which sounds similar and is available for amd64.
>>>
>>>
>> Perhaps I'm misunderstanding your question, but our MCA driver is suppor=
ted
>> and enabled by default on both i386 and amd64.
>
> Thanks, it appears that I misunderstood. =A0I ran whereis and found
> /usr/src/sbin/mca and didn't find it on my amd64 system. =A0I do see it i=
n
> my sysctl listing now that I look there.
>

I guess that's designed for ia64 only
(at least there's no hw.mca.first on other arches).

--=20
wbr,
pluknet



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?AANLkTin_77Y=hYDVq7UYqGAq=FwwWFKqVZt6O%2BKKU2Tp>