From owner-freebsd-bugs@FreeBSD.ORG Sat Nov 29 00:28:11 2003 Return-Path: Delivered-To: freebsd-bugs@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id B36A716A4CE; Sat, 29 Nov 2003 00:28:11 -0800 (PST) Received: from geminix.org (gen129.n001.c02.escapebox.net [213.73.91.129]) by mx1.FreeBSD.org (Postfix) with ESMTP id 7075D43F75; Sat, 29 Nov 2003 00:28:10 -0800 (PST) (envelope-from gemini@geminix.org) Message-ID: <3FC85896.1080508@geminix.org> Date: Sat, 29 Nov 2003 09:28:06 +0100 From: Uwe Doering Organization: Private UNIX Site User-Agent: Mozilla/5.0 (X11; U; FreeBSD i386; en-US; rv:1.5) Gecko/20031019 X-Accept-Language: en-us, en MIME-Version: 1.0 To: freebsd-gnats-submit@FreeBSD.org References: <01b401c3b46d$a4ed5cc0$62c4033e@clarity> <20031127140905.GA95486@walton.maths.tcd.ie> <003301c3b617$3d0581e0$62c4033e@clarity> In-Reply-To: <003301c3b617$3d0581e0$62c4033e@clarity> Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit Received: from gemini by geminix.org with asmtp (TLSv1:AES256-SHA:256) (Exim 3.36 #1) id 1AQ0SW-0001TS-00; Sat, 29 Nov 2003 09:28:09 +0100 cc: freebsd-bugs@freebsd.org cc: freebsd-stable@freebsd.org Subject: Re: kern/59719 Re: 4.9 Stable Crashes on SuperMicro with SMP X-BeenThere: freebsd-bugs@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Bug reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 29 Nov 2003 08:28:11 -0000 Jonathan Gilpin wrote: > I've run memtest (memtest86.com) kindly provided by Don and it passed all > the tests. I've installed installed a kernel module to test for memory > errors and found that again no memory errors are found... So this means it's > either a problem with the CPU's or a geniune bug in the kernel. (bugger!) No, that's unfortunately not what it means. If a memory test fails you can draw the conclusion that you have bad memory, but this doesn't work the other way round. If a memory test passes there is still a possibility that a memory chip is the culprit since memory test software cannot find all errors. Also, there is the chip set on the mainboard that coordinates bus access etc. for the two CPUs. Mainboard and chip set developers are known to make errors, too. In this case you would have to swap the entire mainboard, possible with one from a different manufacturer. I can tell you from my own experience that it is really hard to find reliable PC hardware these days, in light of ever shorter and faster product release cycles. Uwe -- Uwe Doering | EscapeBox - Managed On-Demand UNIX Servers gemini@geminix.org | http://www.escapebox.net