From owner-freebsd-current@FreeBSD.ORG Thu Aug 14 19:55:42 2003 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 632DE37B401 for ; Thu, 14 Aug 2003 19:55:42 -0700 (PDT) Received: from mail.chesapeake.net (chesapeake.net [208.142.252.6]) by mx1.FreeBSD.org (Postfix) with ESMTP id 8279E43F75 for ; Thu, 14 Aug 2003 19:55:41 -0700 (PDT) (envelope-from jroberson@chesapeake.net) Received: from localhost (jroberson@localhost) by mail.chesapeake.net (8.11.6/8.11.6) with ESMTP id h7F2sKN35511; Thu, 14 Aug 2003 22:54:20 -0400 (EDT) (envelope-from jroberson@chesapeake.net) Date: Thu, 14 Aug 2003 22:54:20 -0400 (EDT) From: Jeff Roberson To: Adam Migus In-Reply-To: <3F3C4A46.50204@migus.org> Message-ID: <20030814225323.V12093-100000@mail.chesapeake.net> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII cc: Andrew Gallatin cc: current@freebsd.org Subject: Re: make buildkernel hang with SCHED_ULE X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 Aug 2003 02:55:42 -0000 On Thu, 14 Aug 2003, Adam Migus wrote: > Andrew Gallatin wrote: > > >Adam Migus writes: > > > Folks, > > > While doing some performance analysis (doing make -j5 buildkernel) > > > on a set of 14 kernels I've hit one using the SCHED_ULE scheduler > > > that hangs. It happens every time but not necessarily in the same > > > place in the make. > > > > > > ><...> > > > > > The hardware is a dual Xeon box. The kernel is SMP w/ SCHED_ULE > > > instead of SCHED_4BSD, the options required for diskless and the > > > following two options: > > > >You have machdep.hlt_logical_cpus: 1 in your sysctl output. [BTW, > >lots of people read this mail via the web archives at > >http://docs.freebsd.org/cgi/getmsg.cgi?fetch=1073654+0+current/freebsd-current, > >where its impossible to view mime; it would be MUCH better for us if > >appended things like stack traces and sysctl output rather then > >scrambling them for no reason] > > > >SCHED_ULE is incompatible with halting logical CPUs. Something about > >it does't know the core isn't running, so it schedules a job there > >which never runs, and then it gets confused. When I boot a 1 CPU P4 > >with an SMP kernel and machdep.hlt_logical_cpus=1, it hangs before > >making it to multiuser mode.. > > > >Try setting machdep.hlt_logical_cpus=0 (via sysctl now, and in > >/boot/loader.conf so it doesn't happen again). > > > > > >Drew > > > > > > Andrew, > WRT the mime thing. My apologies. It never occured to me as everyone I > know personally uses a "real" mail reader. I'd attached them simply to > keep the scrolling down and allow order independant viewing. Thanks for > the tip. I'll just read them in as plain text in the future. > > WRT the sysctl value. Thanks for the tip. Is this to be considered a > bug in SCHED_ULE? If the default is hlt_logical_cpus=1 I would think > the scheduler should be able to handle it or deal with it > appropriately. Perhaps ignoring the value, setting it to 0 internally > or even just putting a warning message on boot? After all, not everyone > RTFM's. :-) > The MD code does not currently export the status of the CPUs in any reliable way. ULE attempts to recognize halted CPUs but it is not able to due to other issues. I think john baldwin might be solving this for x86. If not I can take a stab at it again. > Thanks again, > > -- > Adam - Migus Dot Org (http://www.migus.org) > > > _______________________________________________ > freebsd-current@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-current > To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org" >