From owner-freebsd-hackers@FreeBSD.ORG Mon Sep 24 09:33:15 2007 Return-Path: Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 4598E16A419 for ; Mon, 24 Sep 2007 09:33:15 +0000 (UTC) (envelope-from kris@FreeBSD.org) Received: from weak.local (hub.freebsd.org [IPv6:2001:4f8:fff6::36]) by mx1.freebsd.org (Postfix) with ESMTP id 3F91613C448; Mon, 24 Sep 2007 09:33:12 +0000 (UTC) (envelope-from kris@FreeBSD.org) Message-ID: <46F78459.4060607@FreeBSD.org> Date: Mon, 24 Sep 2007 11:33:13 +0200 From: Kris Kennaway User-Agent: Thunderbird 2.0.0.6 (Macintosh/20070728) MIME-Version: 1.0 To: Borja Marcos References: In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: freebsd-hackers@freebsd.org, Benjie Chen Subject: Re: Kernel panic on PowerEdge 1950 under certain stress load X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 24 Sep 2007 09:33:15 -0000 Borja Marcos wrote: > > On 22 Sep 2007, at 00:26, Benjie Chen wrote: > >> FreeBSD 6.2 on PowerEdge 1950, RAID1 setup with mfi driver (PERC5i). 4GB >> RAM. I am currently running i386, and not amd64, due to various reasons. >> >> Kernel panic is at 0xC066C731, which from nm shows it's in mtx_lock_spin >> c066c7b4 T _mtx_lock_spin >> c066c85c T _mtx_unlock_sleep >> >> So this could mean that independent stress tests will not result in >> panic if >> there aren't enough concurrency to cause the problem. > > I don't have the exact IP address involved, but we experienced > consistent panics in two heavily loaded mail servers (same hardware > models, Dell Powereedge) runnning Postfix and FreeBSD 6.2. > > Suspecting an issue with the IP stack and smp I tried to set > "debug.mpsafenet=0" and the problems are gone. Of course I've lost some > performance, but the systems have been solid for some weeks so far. What number is the PR with the details? Kris