Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 30 Sep 2008 03:48:27 -0700
From:      Jeremy Chadwick <koitsu@FreeBSD.org>
To:        Bartosz Stec <admin@kkip.pl>
Cc:        freebsd-stable@freebsd.org, Oliver Lehmann <lehmann@ans-netz.de>
Subject:   Re: system hangup - I'm lost
Message-ID:  <20080930104827.GB44675@icarus.home.lan>
In-Reply-To: <48E201DF.5090001@kkip.pl>
References:  <20080929221408.54e6a03a.lehmann@ans-netz.de> <48E201DF.5090001@kkip.pl>

next in thread | previous in thread | raw e-mail | index | archive | help
On Tue, Sep 30, 2008 at 12:39:27PM +0200, Bartosz Stec wrote:
> Oliver Lehmann wrote:
>> Hi,
>>
>> My fileserver has sporadical hangups running 6.3:
>>
>> FreeBSD 6.3-STABLE #0: Thu Jun 19 00:21:00 CEST 2008
>>     olivleh1@nudel.salatschuessel.net:/usr/obj/i386-pentium3-6.3/usr/src/sys/NUDEL
>>
>> The exact release doesn't matter since it happened before. It always
>> happens afer some time of having some load on the system (I'm building
>> ports with tinderbox and during the build process it just hangs up).
>>
>> The system does nothing write out on the console, neither the CRT, nor
>> the serial console.
>>
>> The system itself is:
>>
>> CPU: Intel Pentium III (845.64-MHz 686-class CPU)
>>   Origin = "GenuineIntel"  Id = 0x683  Stepping = 3
>>   Features=0x387fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,PN,MMX,FXSR,SSE>
>> real memory  = 805240832 (767 MB)
>> avail memory = 778481664 (742 MB)
>> ACPI APIC Table: <Intel  N440BX  >
>> FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
>>  cpu0 (BSP): APIC ID:  1
>>  cpu1 (AP): APIC ID:  0
>> ioapic0 <Version 1.1> irqs 0-23 on motherboard
>>
>> while the diskspace is provided by an 3ware RAID:
>>
>> twa0: <3ware 9000 series Storage Controller> port 0x2400-0x24ff mem 0xf4101000-0xf41010ff,0xf4800000-0xf4ffffff irq 18 at device 11.0 on pci0
>> twa0: INFO: (0x04: 0x0053): Battery capacity test is overdue: twa0: 
>> INFO: (0x15: 0x1300): Controller details:: Model 9500S-4LP, 4 ports, 
>> Firmware FE9X 2.08.00.009, BIOS BE9X 2.03.01.052
>>
>> da0 at twa0 bus 0 target 0 lun 0
>> da0: <AMCC 9500S-4LP  DISK 2.08> Fixed Direct Access SCSI-3 device  
>> da0: 100.000MB/s transfers
>> da0: 715224MB (1464778752 512 byte sectors: 255H 63S/T 91178C)
>>
>> I had - in the past - sometimes messages left which where indicating,
>> that the system was not able to allocate swap space fast enough if I
>> recall it correctly (_not_ out of swap space!) but the RAID is kinda
>> fast imho.
>>
>>   Any idea what I could do to shed some more light on this behaviour?
>>   Why it is happening and what really is causing it?
>>   Would enabling the kernel debugger really help here? I mean the system
>>   is really hanging up - except ping response it is not responding to
>>   anything except the reset switch ;)
>>
>>    Greetings, Oliver
>>
>>
>>   
> Personally I'd rather bet on some hardware problem (overheating?) Try to  
> install mbmon from ports. I had also similiar problems with old  
> motherboards with swelled capacitors.

Be careful with mbmon and healthd -- just because they compile and run
does not mean they're working properly (the values shown may be
completely unreliable/incorrect).

It's best to check such things in the system BIOS, unless you have
absolute certainty that your motherboard is supported by mbmon/healthd.

-- 
| Jeremy Chadwick                                jdc at parodius.com |
| Parodius Networking                       http://www.parodius.com/ |
| UNIX Systems Administrator                  Mountain View, CA, USA |
| Making life hard for others since 1977.              PGP: 4BD6C0CB |




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20080930104827.GB44675>