From owner-freebsd-questions@FreeBSD.ORG Wed Jan 2 22:16:21 2008 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 9196916A417; Wed, 2 Jan 2008 22:16:21 +0000 (UTC) (envelope-from jarrod@netleader.com.au) Received: from wallace.netleader.com.au (wallace.netleader.com.au [203.122.246.247]) by mx1.freebsd.org (Postfix) with ESMTP id C113813C4CC; Wed, 2 Jan 2008 22:16:20 +0000 (UTC) (envelope-from jarrod@netleader.com.au) Received: from gromit.local (gromit.local [192.168.0.3]) by wallace.netleader.com.au (8.14.2/8.14.2) with ESMTP id m02LlKae089128 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NO); Thu, 3 Jan 2008 08:17:22 +1030 (CST) (envelope-from jarrod@netleader.com.au) Message-Id: <1DB78354-EBA2-43D0-A2D6-EFDA4950135B@netleader.com.au> From: Jarrod Sayers To: Tom Judge In-Reply-To: <477BAD2B.8070603@tomjudge.com> Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Mime-Version: 1.0 (Apple Message framework v915) Date: Thu, 3 Jan 2008 08:17:20 +1030 References: <59DD6CCE263ECD75A7283A7B@ganymede.hub.org> <477A72B8.8010307@protected-networks.net> <477BAD2B.8070603@tomjudge.com> X-Mailer: Apple Mail (2.915) Cc: "Marc G. Fournier" , Michael Butler , freebsd-stable@freebsd.org, freebsd-questions@freebsd.org Subject: Re: Nagios + 6.3-RELEASE == Hung Process X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 02 Jan 2008 22:16:21 -0000 On 03/01/2008, at 1:56 AM, Tom Judge wrote: > I have also seen this issue, but have always put it down to the way > that > we manage our nagios deployments with cfengine. I will try to deploy > this change and monitor for the problem to see if it persists. I hope I can confirm your frustrations. There is a threading issue with Nagios when it's binaries are linked against libpthread(3) threading library, the default on recent FreeBSD 5.x releases and all 6.x releases. The issue is random and extremely difficult to track down with the symptoms being a second Nagios process sitting on the system hanging a CPU. Be rest assured that I have been working on it, and have seen it on one system of mine. Changes have been submitted for net-mgmt/nagios-devel (aka Nagios 3.0.r1)) to force the build process to link against libthr(3) where available, removing the need to map libpthread() out with /etc/ libmap.conf. If this goes well, as stated in the PR, i'll back-port it to net-mgmt/nagios (aka Nagios 2.10) in the next few days. If anyone out there is running net-mgmt/nagios-devel and feels like trying it for me, see ports/119246 and drop me an email with a before and after "ldd /usr/local/bin/nagios". > On a side note if you want to use broker modules with nagios from port > you need to change the following in the port Makefile in order to make > them load properly: > > From: > USE_AUTOTOOLS= autoconf:259 > To: > SE_AUTOTOOLS= autoconf:259 libltdl:15 > > I sent an email to the maintainer but got no response and my email did > not seem to have affected the last commit to upgrade to 2.10 I did receive that email and the changes went in with the last commit of net-mgmt/nagios-devel to test. No issues have arisen so i'll be back-porting it to net-mgmt/nagios soon for you. There also has been a rather large ports freeze which delayed the upgrade to Nagios 2.10, that PR was submitted on the 1st of November and committed on the 13th of December. Unfortunately your email fell somewhere in the middle, apologies for not letting you know. Jarrod.