From owner-freebsd-stable@FreeBSD.ORG Wed Nov 30 05:50:13 2005 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id EEFA916A425 for ; Wed, 30 Nov 2005 05:50:13 +0000 (GMT) (envelope-from stephen@math.missouri.edu) Received: from sccmmhc91.asp.att.net (sccmmhc91.asp.att.net [204.127.203.211]) by mx1.FreeBSD.org (Postfix) with ESMTP id 5D1D943D82 for ; Wed, 30 Nov 2005 05:50:13 +0000 (GMT) (envelope-from stephen@math.missouri.edu) Received: from [10.0.0.4] (12-216-248-146.client.mchsi.com[12.216.248.146]) by sccmmhc91.asp.att.net (sccmmhc91) with ESMTP id <20051130055008m91003f5roe>; Wed, 30 Nov 2005 05:50:10 +0000 Message-ID: <438D3D8E.3010609@math.missouri.edu> Date: Tue, 29 Nov 2005 23:50:06 -0600 From: Stephen Montgomery-Smith User-Agent: Mozilla/5.0 (X11; U; FreeBSD i386; en-US; rv:1.7.12) Gecko/20051124 X-Accept-Language: en-us, en MIME-Version: 1.0 To: Dan Charrois References: <20051129204524.C626D16A41F@hub.freebsd.org> <6740EFFC-3303-4030-A175-2348A7067F9A@syz.com> In-Reply-To: <6740EFFC-3303-4030-A175-2348A7067F9A@syz.com> Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: FreeBSD unstable on Dell 1750 using SMP? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 30 Nov 2005 05:50:14 -0000 Dan Charrois wrote: > It actually may be a comfort, since perhaps HTT is related to the > culprit. Since the last crash, about a month ago, I disabled HTT, both > in the kernel as well in the BIOS. So as far as I know, it's > completely been disabled (and the boot messages and top only show 2 > CPUs). And I haven't had the system go down for nearly a month now. I don't know if it is related, but I used to have random reboots on a dual Xeon system with HTT enabled. It happened when I ran a CPU intensive threaded program at the same time as "top" - running "top -s0" (which you have to do as root) could usually kill the machine in seconds if not minutes. All I can tell you is that with FreeBSD 6.0 the problem disappeared. Well not totally - I still get a bunch of harmless calcru negative messages, although I don't know if it is actually related to the boot problems I used to have with FreeBSD 5.4, because I get the calcru backwards messages even with HTT disabled. Anyway, if you are in the mood to try it out, you might like to try re-enabling HTT, starting up whatever process you usually use (I'm guessing it is MySQL), and then run "top -s0". If you get a crash soon after that, you have the same problem I had. Let me also add that these crashes usually did not trigger a crash dump (I had dumpon set), and when it did the resulting dump looked rather corrupted. Stephen