From owner-freebsd-stable@FreeBSD.ORG Thu Oct 9 05:12:16 2008 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 7D1A01065686 for ; Thu, 9 Oct 2008 05:12:16 +0000 (UTC) (envelope-from jdc@koitsu.dyndns.org) Received: from QMTA09.westchester.pa.mail.comcast.net (qmta09.westchester.pa.mail.comcast.net [76.96.62.96]) by mx1.freebsd.org (Postfix) with ESMTP id 27E968FC1E for ; Thu, 9 Oct 2008 05:12:15 +0000 (UTC) (envelope-from jdc@koitsu.dyndns.org) Received: from OMTA14.westchester.pa.mail.comcast.net ([76.96.62.60]) by QMTA09.westchester.pa.mail.comcast.net with comcast id QUSS1a0021HzFnQ59VCFey; Thu, 09 Oct 2008 05:12:15 +0000 Received: from koitsu.dyndns.org ([69.181.141.110]) by OMTA14.westchester.pa.mail.comcast.net with comcast id QVCE1a00A2P6wsM3aVCEiv; Thu, 09 Oct 2008 05:12:15 +0000 X-Authority-Analysis: v=1.0 c=1 a=2tN0PPoFp_AA:10 a=QycZ5dHgAAAA:8 a=jnS-VZhvUVw0oufeq4IA:9 a=q1JqK9q-0ePhKuKIobsA:7 a=WF6cMMWmMW9_gmKaNlZF9rnvMvAA:4 a=EoioJ0NPDVgA:10 a=SV7veod9ZcQA:10 a=rC2wZJ5BpNYA:10 a=LY0hPdMaydYA:10 Received: by icarus.home.lan (Postfix, from userid 1000) id 3BB19C9419; Wed, 8 Oct 2008 22:12:14 -0700 (PDT) Date: Wed, 8 Oct 2008 22:12:14 -0700 From: Jeremy Chadwick To: bf Message-ID: <20081009051214.GA94941@icarus.home.lan> References: <20081008183652.GA83351@icarus.home.lan> <501797.33750.qm@web39105.mail.mud.yahoo.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <501797.33750.qm@web39105.mail.mud.yahoo.com> User-Agent: Mutt/1.5.18 (2008-05-17) Cc: freebsd-stable@freebsd.org Subject: Re: Recent Problems with RELENG_7 i386 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 09 Oct 2008 05:12:16 -0000 On Wed, Oct 08, 2008 at 10:00:32PM -0700, bf wrote: > > > > --- On Wed, 10/8/08, Jeremy Chadwick wrote: > > > From: Jeremy Chadwick > > Subject: Re: Recent Problems with RELENG_7 i386 > > To: "bf" > > Cc: freebsd-stable@freebsd.org > > Date: Wednesday, October 8, 2008, 2:36 PM > > On Wed, Oct 08, 2008 at 10:19:47AM -0700, bf wrote: > > > After updating to RELENG_7 i386 of this weekend, I > > have been having problems > > > with my machine. When booting normally, the system > > slows or hangs at the > > > login prompt. If I am able to continue past the > > prompt, I sometimes experience > > > erratic mouse behavior, and a subsequent hang, after > > varying lengths of time, > > > even under light workloads. The same problem does not > > seem to occur in > > > single-user mode, and did not occur with the RELENG_7 > > i386 of just over a > > > week ago. I have been unable to obtain crashdumps so > > far, and the only > > > log messages I can find that weren't present > > before are notices like those > > > recorded below: > > > > > > Oct 8 11:00:40 myhost kernel: t_delta > > 15.fd80bdcb75b60200 too short > > > > This comes from src/sys/kern/kern_tc.c, around line 908. > > I'm not > > familiar with the kernel, but two ideas come to mind: > > > > 1) If you have Intel SpeedStep (EIST) or AMD > > Cool'n'Quiet enabled in > > your BIOS, try disabling it, > > > > 2) If you're using powerd, disable it (I don't see > > it enabled), > > > > 3) Try keeping HZ at 1000 (the default). > > > > Thanks, Jeremy, for taking the time to consider my question and reply. > > My CPU is pre-Cool'n'Quiet, and as far as I can tell I had disabled > all forms of power management that may affect the clock speeds. I have > found that by raising kern.hz to 250, or by using the default, I no > longer receive the t_delta is too short messages, and the other problems > are no longer apparent. My question is: why did this occur now? I don't know. We can't rewind time and find out system parameters and kernel details from 6 months ago. :-) I'm thinking it might have something to do with the timecounter selected by the kernel, but as I said, we can't rewind time to find out what things were in the past. The kernel environment variables I'm talking about are kern.timecounter. "sysctl kern.timecounter" could help shed some light here, maybe. It would at least allow us to see what timecounters are available on your system, and if a bad/unreliable one is being selected automatically. > I have been using a similar configuration for months now without any > apparent problems. My original goal in using a lower kern.hz was to > avoid burdening my machine with excessive context switching. This is over my head, technically. I would need to pull John Baldwin into this, since he knows a bit about both (timecounters and context switching). I'm just a simple caveman..... :-) > I saw the relevant section of kern_tc.c before I wrote my first > message, but when skimming through the changes in RELENG_7 over the > past week or two, I couldn't see any commit that may have directly > affected kernel timekeeping. Has some new workload been imposed on > the system by recent changes, that may have made a kern.hz of 100 > insufficient? Is this tuneable setting properly implemented, so that > all parts of the base system are using it's current value rather than > the default? Could some of my hardware, such as my RTC, be > malfunctioning? Well, I believe HZ was increased from 100 to 1000 long ago (RELENG_6?) as a default. I'm really not sure of the implications of decreasing it, besides having less granularity for some things (the only things I know of would be something pertaining to firewalls, I just can't remember what. My brain is full. :-) ) -- | Jeremy Chadwick jdc at parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB |