From owner-freebsd-current@FreeBSD.ORG Fri Nov 13 08:09:41 2009 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 472C81065672; Fri, 13 Nov 2009 08:09:41 +0000 (UTC) (envelope-from avg@icyb.net.ua) Received: from citadel.icyb.net.ua (citadel.icyb.net.ua [212.40.38.140]) by mx1.freebsd.org (Postfix) with ESMTP id 5365E8FC1D; Fri, 13 Nov 2009 08:09:39 +0000 (UTC) Received: from porto.topspin.kiev.ua (porto-e.starpoint.kiev.ua [212.40.38.100]) by citadel.icyb.net.ua (8.8.8p3/ICyb-2.3exp) with ESMTP id KAA23020; Fri, 13 Nov 2009 10:09:16 +0200 (EET) (envelope-from avg@icyb.net.ua) Received: from localhost.topspin.kiev.ua ([127.0.0.1]) by porto.topspin.kiev.ua with esmtp (Exim 4.34 (FreeBSD)) id 1N8rDX-000Gfn-R5; Fri, 13 Nov 2009 10:09:15 +0200 Message-ID: <4AFD140D.7010407@icyb.net.ua> Date: Fri, 13 Nov 2009 10:08:45 +0200 From: Andriy Gapon User-Agent: Thunderbird 2.0.0.23 (X11/20090823) MIME-Version: 1.0 To: Kai Gallasch References: <1031257439203@webmail57.yandex.ru> <941257966918@webmail42.yandex.ru> <200911111504.14906.jhb@freebsd.org> <20091112195932.5875387e@orwell.free.de> In-Reply-To: <20091112195932.5875387e@orwell.free.de> X-Enigmail-Version: 0.96.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: "S.N.Grigoriev" , freebsd-current@freebsd.org Subject: Re: 8.0RC2 amd64 - kernel panic running make buildworld X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 13 Nov 2009 08:09:41 -0000 on 12/11/2009 20:59 Kai Gallasch said the following: > sonnenkraft:~ # MCA: CPU 4 UNCOR PCC OVER DTLB L1 error Kai, very interesting info, it matches what Serguey reported too, thank you for the test! So in all cases where MCE information is captured it seems to be L1 data TLB error. John, BTW, OVER may be incorrectly reported by hardware in this case, see erratum 60: http://www.amd.com/us-en/assets/content_type/white_papers_and_tech_docs/41322.pdf Kai, I have a hunch, could you please try the following _sledgehammer_ patch (only kernel build/install is needed): diff --git a/sys/amd64/amd64/pmap.c b/sys/amd64/amd64/pmap.c index 44b71f3..a456609 100644 --- a/sys/amd64/amd64/pmap.c +++ b/sys/amd64/amd64/pmap.c @@ -2981,6 +2981,7 @@ setpte: * Map the superpage. */ pde_store(pde, PG_PS | newpde); + pmap_invalidate_all(pmap); pmap_pde_promotions++; CTR2(KTR_PMAP, "pmap_promote_pde: success for va %#lx" This will slow down an act of promotion to a superpage, but should not have any visible impact on overall performance. Serguey, you problem seems to not be limited to superpages only, so I am not sure if this patch would be of much help to you. -- Andriy Gapon