Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 21 May 2008 08:13:34 -0700
From:      Chris Pratt <eagletree@hughes.net>
To:        Chris Pratt <eagletree@hughes.net>
Cc:        freebsd-questions@freebsd.org, Alan Gilmour <alandgilmour@gmail.com>
Subject:   Re: Server crashing, no explanations
Message-ID:  <3F46D058-8AEF-419B-AA78-B6E060FCAF92@hughes.net>
In-Reply-To: <7361201E-387B-44AA-BFE8-1AF2FE06380D@hughes.net>
References:  <38f284ee0805200717l7008e18fud9631bf80839ceb1@mail.gmail.com> <7361201E-387B-44AA-BFE8-1AF2FE06380D@hughes.net>

next in thread | previous in thread | raw e-mail | index | archive | help

On May 21, 2008, at 8:05 AM, Chris Pratt wrote:

>
> On May 20, 2008, at 7:17 AM, Alan Gilmour wrote:
>
>> Hey all,
>>
>> We have recently been getting a lot of traffic to one of our sites.
>> The CPU is consistently during busy periods using 100% utilisation.
>> When this happens we have approx 150 apache threads, and the loads
>> goes way above 15.
>>
>> However recently the server has been auto-restarting (when under  
>> heavy
>> load) with no explanation in any logs. I've checked the console log,
>> messages, db logs e.t.c. but no mention of anything wrong.
>>
>> Brief server summary :
>>
>> FreeBSD 6.3-STABLE #0:
>> CPU: Intel(R) Xeon(TM) CPU 2.80GHz (2800.11-MHz 686-class CPU)
>>  Logical CPUs per core: 2
>> real memory  = 17716740096 (16896 MB)
>> avail memory = 16837763072 (16057 MB)
>> FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
>>
>> We tried installing mbmon and lmmon and healthd, but none seem to  
>> work.
>>
>> Anyone got any suggestions for other things we can try to detect why
>> the server is failing? or other ways to check things like CPU temp  
>> and
>> memory status?
>
> We have experienced this since 6.x began and it's not hardware.
> It can be reproduced by moving the role to another similar server.
> When the role is changed and the traffic (not necessarily the load),
> the problem goes away or rather, will transfer to the new box.
>
> Look at the thread named "zonealarm issues" on Freebsd-Net a

BIG CORRECTION: "zonelimit issues" (geez, I hadn't touched a
windows product in 3 years, no idea where that came from,
sorry).


> couple of months ago. You may find it will apply but there aren't
> any answers there yet. I gather that people need more data
> collection. I have never figured out how to get a dump though
> people have recommended things to try over the last couple of
> years. I was hoping 7.0 would be the solution but I'm told it's
> not.
>
> Reduce your traffic and the problem will go away. Split the
> traffic to more than one server is a way to do this. We increased
> our uptime drastically by doing this but we still get hit hard enough
> at times to go down. During our low traffic periods of the year,
> we simply stay up all the time (in the hottest days of summer).
>
> By the way, the symptom I see is never immediate reboot, it will
> hang for reasonable period of time prior to rebooting. As I
> monitor ours 24/7, I reset power on the box before it reboots to
> reduce the outage to customers. If I'm not watching it eventually
> will reboot. Brutal but it works.
>
> Realize it's possible you don't have this problem but there are a
> few of us who do. It has something to do with buffers not being
> freed up.
>
>>
>> Cheers
>>
>> Alan
>> _______________________________________________
>> freebsd-questions@freebsd.org mailing list
>> http://lists.freebsd.org/mailman/listinfo/freebsd-questions
>> To unsubscribe, send any mail to "freebsd-questions- 
>> unsubscribe@freebsd.org"
>
> _______________________________________________
> freebsd-questions@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-questions
> To unsubscribe, send any mail to "freebsd-questions- 
> unsubscribe@freebsd.org"




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?3F46D058-8AEF-419B-AA78-B6E060FCAF92>