From owner-freebsd-current@freebsd.org Mon Aug 14 06:19:14 2017 Return-Path: Delivered-To: freebsd-current@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id B2FCCDD7A70; Mon, 14 Aug 2017 06:19:14 +0000 (UTC) (envelope-from ohartmann@walstatt.org) Received: from mout.gmx.net (mout.gmx.net [212.227.17.20]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "mout.gmx.net", Issuer "TeleSec ServerPass DE-2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 1781D6F47D; Mon, 14 Aug 2017 06:19:13 +0000 (UTC) (envelope-from ohartmann@walstatt.org) Received: from freyja.zeit4.iv.bundesimmobilien.de ([87.138.105.249]) by mail.gmx.com (mrgmx102 [212.227.17.168]) with ESMTPSA (Nemesis) id 0Las1k-1dJ7Ze4A2a-00kT5f; Mon, 14 Aug 2017 08:19:11 +0200 Date: Mon, 14 Aug 2017 08:19:10 +0200 From: "O. Hartmann" To: freebsd-current , freebsd-ports Subject: [poudriere] poudriere non-responsive, zombie sh Message-ID: <20170814081910.27abe60a@freyja.zeit4.iv.bundesimmobilien.de> Organization: Walstatt MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Provags-ID: V03:K0:wWu99sMZjNdZRvaes7DQxc21IVw9bKrMGP/FakTv6TuiBbxrtMI x6So1qn5P2X9dt1AF832mYLcfAas9a3tPs5bfsaS9kuHKzFSQei/BZQ1ZoI3yIo+qbtouMa kmIBVG2CciwQy+lwjd3eI772Ghm2MYweaivZnX2tHzqPpIoZdMRnSfn2TJv2EdCiDTiTxbN /yrNEkUcUbbbCv+3wtUAA== X-UI-Out-Filterresults: notjunk:1;V01:K0:2TNDc8HGlXk=:4S/V+ZSH2Q12Ug/5sjS/Jv Dbcir6fK2vT+zk1P8PENlewZ7K5kTzimNc0B6oDlC2mkprCrAosjVb2So0Ko7wxuF89mcgRBH 2voZLn/3InBK378iMiLuEwJWfwiuFS3nzq1p5x4IWkpUDG+qc+s0kN9Qqt/chQuCQX2y7E9pa kQuPr9XAO9g5BzyW+40w6vnuHvsGoH1ZF8skwAifcb0YoNMDmlU/p88XF/C7Ru031/hQM/0PB NKAYsNQNUrIb2dhcvdZlUKgSdHW9CHqlCXcXCMJeMlkzosTrRs5Qqe+rkHgxYZHB6nI8djyI1 OV1VBv51LjbHO2O2ixy/HH8KT0P353kgET9ktP8hyWNYcFRwTnBi7x5OAXO2hc01p2365Bg7m nl+RHWoDncmdGBMJ/21B/p3sNoGg7+K4oRBlXxsBKWB0tylKI2IDlNsGtN11KlHMYmsopDGzu XLzLiwi8XMJfXBj5bebA84gZQNHMqoFzJySmXF6Ldb57YqaloKq4AcXG6g9Y2q+pg8FowIKfQ uppXre3Dc5qBRyQwdJQvhTL9AiXoi62rNE0O6NMpoN0wbKX4MXYvwkkEIwQrF2c79M4J+9rT1 GoB0eT75SJmQn+ibIoSXWCZsFkN7WO+3EmifQjfBsmKg7BHZJ3oZwvtS5K4iKI7+axyKnYHtk zgYQ6CYaSNNbBME364YE6KBrO1dJAmFGvkxT/xZ8EtkRGVI2nbpCEiQJuF7TShC2JBmjDQ/ji tOmQkriAV1mDVd3SWU2A82zROmowRhMGIITLQZoZy82wULdqbGMv5+LnDXYpWQlB5wH5TCk1x LpLBxiUYyjKestNSvqVM0MSMrFfgxoGn/ar18L+GtL5eIYNfRc= X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 14 Aug 2017 06:19:14 -0000 Running FreeBSD 12.0-CURRENT #87 r322472: Sun Aug 13 21:59:36 CEST 2017 amd64 with jail of the same revision, lately the poudriere build system started to get inresponsive when hitting Ctrl-C or, very often, starts to stop when showing up which package is deleted or has to be rebuild due to changed dependencies. usually, the list of deleted/to-be-rebuild packages show up and then the output flows as packages are build. This stops somehow in the middle of the output. Checking the box then via ps/top, I see the a "sh" eating up a tremendous portion of the CPU time. I have a 4 core/8 threads XEON (IvyBridge based) with 16 GB of RAM using ZFS on a RAIDZ for the poudriere stuff (which induced never problems in the past). When havin hit the Ctrl-C key, there are only two jails left not dying, I have to use "poudriere jail -k" to kill the jail. But then, the zombie-shell (sh) remains eating CPU time - no idea wht the shell is doing so far. This strange behaviour occured within the last two weeks on several poudriere hosts the same time with unchanged configurations working prior to this observation. Waiting long enough - in some cases hours! - the shell will finally die (after Ctrl-C). I haven't checked whether the poudriere jobs work to the end in the back when not showing progress on the terminal, I got impatient after a couple of hours and stopped them. Seems therer is an issue lately introduced. Can someone shed some light on it? The problem is erratic - I can not easily reproduce it, and I also can not say whether it is a ZFS or shell or kernel issue. kind regards, Oliver