Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 20 Feb 2008 18:14:47 -0500
From:      "Alexandre \"Sunny\" Kovalenko" <alex.kovalenko@verizon.net>
To:        Daniel Eischen <eischen@vigrid.com>
Cc:        acpi@freebsd.org
Subject:   Re: How to disable acpi thermal?
Message-ID:  <1203549287.1019.43.camel@RabbitsDen>
In-Reply-To: <Pine.GSO.4.64.0802201711090.7855@sea.ntplx.net>
References:  <Pine.GSO.4.64.0801142156360.24324@sea.ntplx.net> <1200369199.2054.38.camel@RabbitsDen> <Pine.GSO.4.64.0801151525160.29868@sea.ntplx.net> <1203131179.833.32.camel@RabbitsDen> <Pine.GSO.4.64.0802201711090.7855@sea.ntplx.net>

next in thread | previous in thread | raw e-mail | index | archive | help

On Wed, 2008-02-20 at 17:15 -0500, Daniel Eischen wrote:
> On Fri, 15 Feb 2008, Alexandre "Sunny" Kovalenko wrote:
> 
> >
> > On Tue, 2008-01-15 at 15:34 -0500, Daniel Eischen wrote:
> >> [ Redirected from -current ]
> >>
> >> On Mon, 14 Jan 2008, Alexandre "Sunny" Kovalenko wrote:
> >>
> >>>
> >>> On Mon, 2008-01-14 at 21:56 -0500, Daniel Eischen wrote:
> >>>>
> >>>> Thermal zone 0 skyrockets past 110C in a couple of minutes
> >>>> when trying to build a kernel.  All the other zones stay
> >>>> relatively static.  I suspect something is wrong somewhere
> >>>> because this machine is very lightly loaded and has never
> >>>> had a problem until now.  I just upgraded it from 4.x to
> >>>> 7.0.
> >>>
> >>> It need not to be bogus -- if I turn off fan on my ThinkPad it will
> >>> overheat and shut itself down within couple of minutes of buildworld,
> >>> starting from the relative cool state. From the look of the stuff below
> >>> your fan should kick in no later then 10 seconds after tz0 reached 77C.
> >>> Do you hear it running before shutdown? If yes, maybe lowering threshold
> >>> in AC0 down from 77C will help. If not -- you will need to figure out
> >>> who is supposed to turn on the fan. You can dump your ASL (instructions
> >>> in the handbook) and post it someplace accessible -- I will take a look
> >>> and maybe spot something interesting, but, being far from the expert in
> >>> the field, I do not promise too much.
> >>
> >> I posted the acpidump here:
> >>
> >>    http://people.freebsd.org/~deischen/stl2.iasl
> >>
> >> The problem is that acpi_thermal keeps shutting down the system
> >> after 2 minutes into a buildkernel.  The system has no load other
> >> than the buildkernel at the time it shuts down.
> >>
> >> The system is a Intel STL2 Tupelo motherboard with 1 CPU, the
> >> other CPU socket being occupied by a CPU terminator thingy.
> >> I uncovered the rackmount system and watched it while building
> >> a kernel.  With the cover off the acpi monitored temperature
> >> went to 107C and stayed there.  It only took a minute or two
> >> to get there.  I felt around inside the chassis and nothing
> >> was even near being to warm or hot.  With the cover on, the
> >> temperature goes to 111/112C before being shutdown by acpi_thermal
> >> (the limit being 110C).  There is no way anything in that
> >> chassis is anywhere near 100C.  I've disabled acpi_thermal
> >> for now, but it'd be nice to get a better fix.
> >>
> >> Any ideas?
> >>
> > You can try this patch on your ASL, which might just cause passive
> > cooling to kick in. If you decide to try a patch, I would like to see
> > the output of
> 
> I guess I'm confused - how can passive cooling "kick in".  Isn't
> passive cooling always on if you are using a heatsink?
In the ACPI context (and please, bear with me -- I am no expert -- I
just read respective pieces of the spec and experimented with few
specimens of my own hardware) "passive" cooling is lowering of the CPU
frequency when temperature reaches given point, as denoted by
hw.acpi.thermal.tzN._PSV value. This will happen (I have tried ;)
regardless of the efforts of powerd to raise the frequency due to the
load history. This helps in the situations when CPU could not run at
maximum load for protracted periods of time.

I assume (possibly incorrectly) that 1) your CPU is capable of the
frequency throttling and 2) you are using frequency governor of some
sort (see cpufreq(4) for detail). If this is not the case, the change
will not help.

Also, since I have sent you that change, I have learned that setting
hw.acpi.thermal.user_override=1 and hw.acpi.thermal.tz0._PSV=85C might
accomplish the same thing as the ASL change. I saw it working for the
thermal zone which already had sensible _PSV, but I have no hardware to
try this approach when _PSV is not present in the ASL. 

> 
> > sysctl hw.acpi.thermal
> >
> > regardless of the outcome.
> >
> > OTOH, it just occurred to me that I have observed something like that on
> > my previous laptop. I used cheap thermal paste between the CPU and the
> > heatsink and I used a lot of it. Chassis were relatively cool and yet
> > CPU sensor hit critical trip point.
> 
-- 
Alexandre "Sunny" Kovalenko (Олександр Коваленко)




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?1203549287.1019.43.camel>