From owner-freebsd-current@FreeBSD.ORG Tue Aug 24 14:04:44 2004 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 1302616A4CF; Tue, 24 Aug 2004 14:04:44 +0000 (GMT) Received: from fledge.watson.org (fledge.watson.org [204.156.12.50]) by mx1.FreeBSD.org (Postfix) with ESMTP id ADD6543D2D; Tue, 24 Aug 2004 14:04:43 +0000 (GMT) (envelope-from robert@fledge.watson.org) Received: from fledge.watson.org (localhost [127.0.0.1]) by fledge.watson.org (8.12.11/8.12.11) with ESMTP id i7OE2UMX099974; Tue, 24 Aug 2004 10:02:30 -0400 (EDT) (envelope-from robert@fledge.watson.org) Received: from localhost (robert@localhost)i7OE2UBT099971; Tue, 24 Aug 2004 10:02:30 -0400 (EDT) (envelope-from robert@fledge.watson.org) Date: Tue, 24 Aug 2004 10:02:30 -0400 (EDT) From: Robert Watson X-Sender: robert@fledge.watson.org To: Lukas Ertl In-Reply-To: <20040824141238.K3016@pcle2.cc.univie.ac.at> Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII cc: freebsd-current@FreeBSD.org Subject: Re: PLEASE TEST: IPI deadlock avoidance patch X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 24 Aug 2004 14:04:44 -0000 On Tue, 24 Aug 2004, Lukas Ertl wrote: > On Mon, 23 Aug 2004, Lukas Ertl wrote: > > > On Sun, 22 Aug 2004, Doug White wrote: > > > >> If you have a reasonably fast i386 or amd64 multiprocessor and/or > >> hyperthreading machine and are experiencing reproducible hangs during -j > >> buildwords and other highly parallel operations, please try this patch: > >> > >> http://people.freebsd.org/~dwhite/smp_rv_mtx.patch > > > > Looks good here - several of my Xeon boxes that were subject to deadlocks > > have survived -j4 buildworlds and other load situations with this patch. > > OK, it seems I shouted out too early. '-jX' buildworlds go fine, but my > news server (heavy disk and network traffic) still locks up within one > day of uptime. It's a Dual Xeon with HTT, SCHED_4BSD, ADAPTIVE_GIANT, > and no WITNESS or INVARIANTS. Using a serial break, are you able to get into the debugger on hang? If not, have you tried SW_WATCHDOG or MP_WATCHDOG? Robert N M Watson FreeBSD Core Team, TrustedBSD Projects robert@fledge.watson.org Principal Research Scientist, McAfee Research