From owner-freebsd-questions@FreeBSD.ORG Tue Jan 1 16:33:08 2008 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0EF2A16A418 for ; Tue, 1 Jan 2008 16:33:08 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from hub.org (hub.org [200.46.204.220]) by mx1.freebsd.org (Postfix) with ESMTP id BAD3313C4D3 for ; Tue, 1 Jan 2008 16:33:07 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from localhost (unknown [200.46.204.184]) by hub.org (Postfix) with ESMTP id E1A3A11FDD1B; Tue, 1 Jan 2008 12:33:06 -0400 (AST) Received: from hub.org ([200.46.204.220]) by localhost (mx1.hub.org [200.46.204.184]) (amavisd-maia, port 10024) with ESMTP id 96393-09; Tue, 1 Jan 2008 12:33:06 -0400 (AST) Received: from fserv.hub.org (blk-7-245-234.eastlink.ca [71.7.245.234]) by hub.org (Postfix) with ESMTP id 55A1411FDCE4; Tue, 1 Jan 2008 12:33:06 -0400 (AST) Received: from [192.168.1.2] (unknown [192.168.1.2]) by fserv.hub.org (Postfix) with ESMTP id A5AD05F762; Tue, 1 Jan 2008 12:33:05 -0400 (AST) Date: Tue, 01 Jan 2008 12:32:07 -0400 From: "Marc G. Fournier" To: freebsd-questions@freebsd.org Message-ID: <59DD6CCE263ECD75A7283A7B@ganymede.hub.org> X-Mailer: Mulberry/4.0.8 (Linux/x86) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Content-Disposition: inline Cc: jarrod@netleader.com.au, freebsd-stable@freebsd.org Subject: Nagios + 6.3-RELEASE == Hung Process X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 01 Jan 2008 16:33:08 -0000 -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 G'day ... Yesterday, I setup nagios to do some system monitoring ... installed the latest version from ports into a jail, so that I could easily move it around between machines as I upgrade, without losing data ... after about 30 minutes running, I get a second nagios process running (fork?) that takes up ch CPU time as is available, and just hangs there until I kill -9 it ... Figuring that it might be a problem with the jail (trying to access somethign that isn't available to the process in a jail), I moved it to the physical server level ... but, again, after ~30 minutes, its doing the same thing: # ps aux | grep nagios nagios 32065 73.2 0.1 10948 3516 ?? R 11:15AM 7:40.77 /usr/local/bin/nagios -d /usr/local/etc/nagios/nagios.cfg nagios 82120 0.0 0.1 10948 3580 ?? Ss 10:47AM 0:01.18 /usr/local/bin/nagios -d /usr/local/etc/nagios/nagios.cfg So, definitely not jail related ... I've tried to do a 'truss -p 32065', it just hangs. And: ktrace -f /tmp/output -p 32065 ... produces nothing: # kdump -f /tmp/output 32065 nagios PSIG SIGKILL SIG_DFL Once I kill -9 the process, a bunch of 'check_ping' processes start up and then things go back to normal ... My last kernel / world build on that box is: Mon Nov 12 06:43:30 AST 2007 After searching the 'Net a bit, came across this thread: That recommends modifying libmap.conf with: [/usr/local/bin/nagios] libpthread.so.2 libthr.so.2 libpthread.so libthr.so This seems to fix the problem on the physical server, and am currently testing it in the jail itself to make sure it fixes it there too ... Should this be something that is more prominently documented somewhere? Maybe in the port itself? azureus has similar problems that are fixed with entries in libmap.conf, so its not "just a nagios issue" ... - ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email . scrappy@hub.org MSN . scrappy@hub.org Yahoo . yscrappy Skype: hub.org ICQ . 7615664 -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.4 (FreeBSD) iD8DBQFHemsH4QvfyHIvDvMRApUOAKCLRDnmRba6ho4St8qZ6U19V8yJ+wCghMBp Xph3ac9d7QsMjeKBMtmgkuw= =mXxF -----END PGP SIGNATURE-----