From owner-freebsd-stable@FreeBSD.ORG Mon Jan 19 21:09:37 2009 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 26E881065676 for ; Mon, 19 Jan 2009 21:09:37 +0000 (UTC) (envelope-from cpghost@cordula.ws) Received: from fw.farid-hajji.net (fw.farid-hajji.net [213.146.115.42]) by mx1.freebsd.org (Postfix) with ESMTP id 7B3C18FC0C for ; Mon, 19 Jan 2009 21:09:36 +0000 (UTC) (envelope-from cpghost@cordula.ws) Received: from phenom.cordula.ws (phenom [192.168.254.60]) by fw.farid-hajji.net (Postfix) with ESMTP id 16ACB36824; Mon, 19 Jan 2009 21:54:05 +0100 (CET) Date: Mon, 19 Jan 2009 21:56:24 +0100 From: cpghost To: Pete Carah Message-ID: <20090119205624.GA1375@phenom.cordula.ws> References: <200901190230.n0J2U5S5002220@port2.altadena.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <200901190230.n0J2U5S5002220@port2.altadena.net> User-Agent: Mutt/1.5.18 (2008-05-17) Cc: stable@freebsd.org Subject: Soekris 4801 hangs (was Re: Hangs, maybe a clue) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 19 Jan 2009 21:09:37 -0000 On Sun, Jan 18, 2009 at 09:30:05PM -0500, Pete Carah wrote: > I've had some mysterious hangs which I notice that several others have too. > Two of the machines in question are Soekris 4801's running as routers; this > is hard to handle ddb with (though possible for one of them...) I started > noticing this sometime in December. My laptop finally hung in a state where > I could do a ps (waiting a long time for the response.) The strange and > likely related to the hang was softdepflush in R state with 43 MINUTES of > cpu. (the machine has been up maybe an hour.) I'm seeing those hangs on Soekris 4801 routers running RELENG_7 as well. The boxes are used as SoHo appliances and run mpd5, pf, named, postfix, cyrus-imapd, lighttpd, openntpd and sshd. On all of them, softupdates are enabled on all partitions except root and they use real HDDs (not compact flash). The hangs appear now every 2 or 3 days at different times. They don't seem related to traffic type (heavy p2p, normal upload/download, or idle) and also seem independent on disk activity (i.e. there's no more during 3am than other time). They were less frequent before Dec 1st, and IIRC the last RELENG_7 that was nearly hang-free was 2008-11-07. IMHO, it could be some kind of resource leak (?), but I'm not sure. Since the only serial port is used by getty, I'm not sure how to break into the debugger and how to trace the problem (and I'm not experienced enough for this). :( > -- Pete Regards, -cpghost. -- Cordula's Web. http://www.cordula.ws/