Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 16 May 2011 22:48:46 +0300
From:      Andriy Gapon <avg@FreeBSD.org>
To:        John Hay <jhay@meraka.org.za>
Cc:        alc@FreeBSD.org, freebsd-stable@FreeBSD.org
Subject:   Re: MCA: CPU 0 UNCOR PCC DTLB L1 error
Message-ID:  <4DD17F9E.6000702@FreeBSD.org>
In-Reply-To: <20110516162319.GA58581@zibbi.meraka.csir.co.za>
References:  <20110510125220.GA88338@zibbi.meraka.csir.co.za>	<BANLkTik79gjQKsdrz_8mQdLc3e9KGiGzzQ@mail.gmail.com> <20110516162319.GA58581@zibbi.meraka.csir.co.za>

next in thread | previous in thread | raw e-mail | index | archive | help
on 16/05/2011 19:23 John Hay said the following:
> 
> I have applied the patch, but got another one today.

Can you please double-check that you indeed have the patch and you are running a
patched kernel.

> I still do not get
> a prompt or dump. :-( 

That could be expected, because MCE is not a software problem, but a hardware one.

> It just get stuck right after #4. If there is anything
> more that I can try, just ask.

Please try to disable superpages, put vm.pmap.pg_ps_enabled="0" in your loader.conf.

If the problem persists, then my guess would be that you have a genuine hardware
problem with your CPU.  Next thing to try would be to replace it.

If disabling superpages helps you, then please let us know.  But please do not
hurry with conclusions.

> #####################################################################
> MCA: Bank 0, Status 0xb600000000010015
> MCA: Global Cap 0x0000000000000106, Status 0x0000000000000004
> MCA: Vendor "AuthenticAMD", ID 0x500f10, APIC ID 0
> MCA: CPU 0 UNCOR PCC DTLB L1 error

To Alan, just in case: previously OVER bit was also set here.

> MCA: Address 0x808ace000
> 
> 
> Fatal trap 28: machine check trap while in user mode
> cpuid = 1; apic id = 01
> instruction pointer	= 0x43:0x80af206d5
> stack pointer	        = 0x3b:0x7fffffffb8e8
> frame pointer	        = 0x3b:0x809b92450
> code segment		= base 0x0, limit 0xfffff, type 0x1b
> 			= DPL 3, pres 1, long 1, def32 0, gran 1
> processor eflags	= interrupt enabled, IOPL = 0
> current process		= 22228 (initial thread)
> trap number		= 28
> panic: machine check trap
> cpuid = 1
> KDB: stack backtrace:
> #0 0xffffffff80608f6e at kdb_backtrace+0x5e
> #1 0xffffffff805d6917 at panic+0x187
> #2 0xffffffff808bf7c0 at trap_fatal+0x290
> #3 0xffffffff808bfda9 at trap+0x109
> #4 0xffffffff808a8084 at calltrap+0x8
> #####################################################################



-- 
Andriy Gapon



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?4DD17F9E.6000702>