From owner-freebsd-current@FreeBSD.ORG Tue Feb 14 14:09:36 2012 Return-Path: Delivered-To: current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 9714B106567E; Tue, 14 Feb 2012 14:09:36 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 6E42B8FC22; Tue, 14 Feb 2012 14:09:36 +0000 (UTC) Received: from bigwig.baldwin.cx (bigwig.baldwin.cx [96.47.65.170]) by cyrus.watson.org (Postfix) with ESMTPSA id 25B3E46B3C; Tue, 14 Feb 2012 09:09:36 -0500 (EST) Received: from jhbbsd.localnet (unknown [209.249.190.124]) by bigwig.baldwin.cx (Postfix) with ESMTPSA id 8157CB966; Tue, 14 Feb 2012 09:09:35 -0500 (EST) From: John Baldwin To: TAKAHASHI Yoshihiro Date: Tue, 14 Feb 2012 09:09:33 -0500 User-Agent: KMail/1.13.5 (FreeBSD/8.2-CBSD-20110714-p10; KDE/4.5.5; amd64; ; ) References: <20120212.185549.343708041257811633.nyan@FreeBSD.org> <201202131308.48171.jhb@freebsd.org> <20120214.215750.343708041257791038.nyan@FreeBSD.org> In-Reply-To: <20120214.215750.343708041257791038.nyan@FreeBSD.org> MIME-Version: 1.0 Content-Type: Text/Plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Message-Id: <201202140909.33894.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.2.7 (bigwig.baldwin.cx); Tue, 14 Feb 2012 09:09:35 -0500 (EST) Cc: current@freebsd.org Subject: Re: MCA UNCOR error X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 14 Feb 2012 14:09:36 -0000 On Tuesday, February 14, 2012 7:57:50 am TAKAHASHI Yoshihiro wrote: > In article <201202131308.48171.jhb@freebsd.org> > John Baldwin writes: > > > On Sunday, February 12, 2012 4:55:49 am TAKAHASHI Yoshihiro wrote: > >> I get the following error and kernel panic on my pc98 at the boot time. > >> > >> MCA: Bank 2, Status 0xb600000000140000 > >> MCA: Global Cap 0x0000000000000005, Status 0x0000000000000004 > >> MCA: Vendor "GenuineIntel", ID 0x616, APIC ID 0 > >> MCA: CPU 0 UNCOR PCC no error > >> MCA: Address 0x3446ff003446ff > >> > >> When I disable MCA with hw.mca.enabled=0 on loader prompt, the machine > >> works fine. Does it mean my pc98 is broken? Or other isssue? > >> > >> It's spec is: > >> > >> FreeBSD 10.0-CURRENT #5: Thu Feb 9 13:18:22 UTC 2012 > >> CPU: Pentium Pro (198.95-MHz 686-class CPU) > >> Origin = "GenuineIntel" Id = 0x616 Family = 6 Model = 1 Stepping = 6 > >> Features=0xf9ff > >> real memory = 134217728 (128 MB) > >> avail memory = 120471552 (114 MB) > > > > Interesting, that is odd to get an error with no error code. > > > > Can you tell from the stack trace if your CPU actually raised a machine check > > exception (trap 28)? > > I tested with debugger enabled kernel. > Please get from: > http://home.jp.freebsd.org/~nyan/mca-error1.jpg > http://home.jp.freebsd.org/~nyan/mca-error2.jpg > (Sorry for jpeg images) Humm. Try this: Index: x86/x86/mca.c =================================================================== --- mca.c (revision 231678) +++ mca.c (working copy) @@ -548,7 +548,12 @@ mca_scan(enum scan_mode mode) valid = mca_check_status(i, &rec); if (valid) { count++; - if (rec.mr_status & ucmask) { + /* + * Don't treat records that do not specify an + * error as unrecoverable. + */ + if ((rec.mr_status & MC_STATUS_MCA_ERROR) != 0 && + rec.mr_status & ucmask) { recoverable = 0; mca_log(&rec); } -- John Baldwin