Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 15 Jul 2006 01:08:36 -0300 (ADT)
From:      User Freebsd <freebsd@hub.org>
To:        Kostik Belousov <kostikbel@gmail.com>
Cc:        freebsd-stable@freebsd.org, Robert Watson <rwatson@freebsd.org>, Michel Talon <talon@lpthe.jussieu.fr>, Francisco Reyes <lists@stringsutils.com>
Subject:   Re: vm_map.c lock up (Was: Re: NFS Locking Issue)
Message-ID:  <20060715010607.L1799@ganymede.hub.org>
In-Reply-To: <20060715035308.GJ32624@deviant.kiev.zoral.com.ua>
References:  <E1FxzUU-000MMw-5m@cs1.cs.huji.ac.il> <20060705100403.Y80381@fledge.watson.org> <cone.1152136419.991036.72616.1000@zoraida.natserv.net> <20060705234514.I70011@fledge.watson.org> <20060715000351.U1799@ganymede.hub.org> <20060715035308.GJ32624@deviant.kiev.zoral.com.ua>

next in thread | previous in thread | raw e-mail | index | archive | help
On Sat, 15 Jul 2006, Kostik Belousov wrote:

> On Sat, Jul 15, 2006 at 12:10:29AM -0300, User Freebsd wrote:
>>
>>
>> On Wed, 5 Jul 2006, Robert Watson wrote:
>>
>>> If you can get into DDB when the hang has occurred, output via serial
>>> console for the following commands would be very helpful:
>>>
>>> show pcpu
>>> show allpcpu
>>> ps
>>> trace
>>> traceall
>>> show locks
>>> show alllocks
>>> show uma
>>> show malloc
>>> show lockedvnods
>>
>> 'k, after 16 days uptime, the server that I got all the debugging turned
>> on for finally hung up solid ... I was able to break into DDB over the
>> serial link, and have run all of the above on it ... and the output is
>> attached ...
>>
>> One thing to note is that the ps listing is not complete ... there are >6k
>> processes running at the time, and I don't know how to get rid of the
>> '--more--' prompt :(  After 1k processes, I just hit 'q' and went onto the
>> other commands ...
> set lines=0
>>
>> Also, traceall gave me a 'No such command' error ... now that I think
>> about it, my luck, it was supposed to be 'trace all'?
> It is alltrace.
>>
>> If this doesn't provide enough information, please let me know what else I
>> should do the next time through, besides the above commands ...
> Missing alltrace output seems to be critical. If this is not feasible,
> please, provide at least the output of the bt <pid> for each pid
> shown in the "show lockedvnods" and "show alllocks". In you case,
> bt 64880 was the most interesting. It is pity that you had reset the
> machine.

Was down for too long as it was ... it, of course, happened while I was 
out with the family :(

Will keep all of this in mind next time I get a chance to run through 
things ...

Any idea why 'panic' doesn't produce core like it used to?

> Just in case, do you use mlocked mappings ? Also, why so huge number of 
> crons exist in the system ? The are all forking now. It may be (can not 
> say definitely without further investigation) just a fork bomb.

mlocked mappings?  What are they? :)

re: crons ... this, I'm not sure of, but my suspicion was that the crons 
weren't able to complete, since the file system was locked up, but the 
next one was being attempted to run ... *shrug*

----
Marc G. Fournier           Hub.Org Networking Services (http://www.hub.org)
Email . scrappy@hub.org                              MSN . scrappy@hub.org
Yahoo . yscrappy               Skype: hub.org        ICQ . 7615664



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20060715010607.L1799>