From owner-freebsd-alpha@FreeBSD.ORG Thu Aug 18 23:10:17 2005 Return-Path: X-Original-To: freebsd-alpha@freebsd.org Delivered-To: freebsd-alpha@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id C55CD16A41F for ; Thu, 18 Aug 2005 23:10:17 +0000 (GMT) (envelope-from ticso@cicely12.cicely.de) Received: from srv1.cosmo-project.de (srv1.cosmo-project.de [213.83.6.106]) by mx1.FreeBSD.org (Postfix) with ESMTP id 2F0FF43D45 for ; Thu, 18 Aug 2005 23:10:16 +0000 (GMT) (envelope-from ticso@cicely12.cicely.de) Received: from cicely5.cicely.de (cicely5.cicely.de [10.1.1.7]) (authenticated bits=0) by srv1.cosmo-project.de (8.12.10/8.12.10) with ESMTP id j7INABBU001124 (version=TLSv1/SSLv3 cipher=EDH-RSA-DES-CBC3-SHA bits=168 verify=OK); Fri, 19 Aug 2005 01:10:14 +0200 (CEST) (envelope-from ticso@cicely12.cicely.de) Received: from cicely12.cicely.de (cicely12.cicely.de [10.1.1.14]) by cicely5.cicely.de (8.12.10/8.12.10) with ESMTP id j7IN9I5C003990 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Fri, 19 Aug 2005 01:09:19 +0200 (CEST) (envelope-from ticso@cicely12.cicely.de) Received: from cicely12.cicely.de (localhost [127.0.0.1]) by cicely12.cicely.de (8.12.11/8.12.11) with ESMTP id j7IN9ITX092739; Fri, 19 Aug 2005 01:09:18 +0200 (CEST) (envelope-from ticso@cicely12.cicely.de) Received: (from ticso@localhost) by cicely12.cicely.de (8.12.11/8.12.11/Submit) id j7IN9Hbe092738; Fri, 19 Aug 2005 01:09:17 +0200 (CEST) (envelope-from ticso) Date: Fri, 19 Aug 2005 01:09:17 +0200 From: Bernd Walter To: harrisb@rcisd.org Message-ID: <20050818230916.GD90999@cicely12.cicely.de> References: <20050818111551.GP77387@cicely12.cicely.de> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Operating-System: FreeBSD cicely12.cicely.de 5.2-CURRENT alpha User-Agent: Mutt/1.5.9i X-Spam-Status: No, hits=-4.9 required=3.0 tests=BAYES_00 autolearn=ham version=2.64 X-Spam-Report: * -4.9 BAYES_00 BODY: Bayesian spam probability is 0 to 1% * [score: 0.0000] X-Spam-Checker-Version: SpamAssassin 2.64 (2004-01-11) on cicely12.cicely.de Cc: ticso@cicely.de, freebsd-alpha@freebsd.org Subject: Re: machine check on 4100 5.4-RELEASE X-BeenThere: freebsd-alpha@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: ticso@cicely.de List-Id: Porting FreeBSD to the Alpha List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 18 Aug 2005 23:10:18 -0000 On Thu, Aug 18, 2005 at 10:24:42AM -0500, harrisb@rcisd.org wrote: > Here's the panic info: > > -------------------------------------------- > Mounting root from ufs:/dev/da0a > > unexpected machine check: > > mces = 0x1 > vector = 0x670 > param = 0xfffffc0000004e10 > pc = 0xfffffc000072faa8 > ra = 0xfffffc000072bba4 > curproc = 0xfffffc002e7cc000 > pid = 693, comm = perl5.8.6 > I could be completly wrong, but vector 0x670 (on AS4100) should be CPU detected errors, such as cache failure - either B-Cache or internal, which points to a CPU module. Are they always the same, or will they happen at random times? In any case you should check the SRM error log - might be something interesting in there. If they are logged they should appear as MCHK 670 events. > On Wed, Aug 17, 2005 at 04:53:28PM -0500, harrisb@rcisd.org wrote: > > All of a sudden, I'm getting regular crashes with machine check's. > > > > I've pulled one of the 533 CPU's, which didn't help, and now am > > wondering if it's possible that my instance of Mysql with all it's > > unaligned errors > > could possibly cause it crash? I've stopped the mysql daemon for a > > while just to see if it stabilizes. Anyone have any ideas? > > > > It will crash after it's been up for days, and then immediately after > > reboot. > > Details about the machine checks would be interesting. > > Unaligned errors in userland are corrected or the appplication is > terminated, depending on configuration. > Only unaligned faults inside the kernel are fatal. > > > I keep thinking hardware, but all the srm test fine. > > Hard- and software is possible, but without further details this is > hard to say. -- B.Walter BWCT http://www.bwct.de bernd@bwct.de info@bwct.de