From owner-freebsd-stable@FreeBSD.ORG Fri Jun 2 01:06:31 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 0C17516ADA0; Fri, 2 Jun 2006 01:06:31 +0000 (UTC) (envelope-from kris@obsecurity.org) Received: from elvis.mu.org (elvis.mu.org [192.203.228.196]) by mx1.FreeBSD.org (Postfix) with ESMTP id 929EE43D45; Fri, 2 Jun 2006 01:06:30 +0000 (GMT) (envelope-from kris@obsecurity.org) Received: from obsecurity.dyndns.org (elvis.mu.org [192.203.228.196]) by elvis.mu.org (Postfix) with ESMTP id 7E5B01A4E8E; Thu, 1 Jun 2006 18:06:30 -0700 (PDT) Received: by obsecurity.dyndns.org (Postfix, from userid 1000) id ED43A5153E; Thu, 1 Jun 2006 21:06:29 -0400 (EDT) Date: Thu, 1 Jun 2006 21:06:29 -0400 From: Kris Kennaway To: Mark Morley Message-ID: <20060602010629.GC40143@xor.obsecurity.org> References: <447f640d-11663@helpdesk.islandnet.com> Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="tqI+Z3u+9OQ7kwn0" Content-Disposition: inline In-Reply-To: <447f640d-11663@helpdesk.islandnet.com> User-Agent: Mutt/1.4.2.1i Cc: freebsd-fs@freebsd.org, freebsd-stable@freebsd.org Subject: Re: NFS processes locking up!! X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 02 Jun 2006 01:06:32 -0000 --tqI+Z3u+9OQ7kwn0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Thu, Jun 01, 2006 at 03:02:53PM -0700, Mark Morley wrote: > Hi all, >=20 > We have an NFS server (amd64) running FreeBSD 6.1-STABLE. It serves a do= zen > or so clients which are a mix of FreeBSD 4.11 and 6.1-STABLE. All NFS tr= affic > is on a dedicated gigabit switched network. >=20 > Periodically we have a problem where it will stop serving up files. Runn= ing 'ps' > on the server shows a number of processes stuck in the 'D' state -- "a pr= ocess in > disk (or other short term, uninter-ruptible) wait". >=20 > Usually this includes all the nfsd processes as well as any others that a= re trying > to access the same disk drive. Any commands issued like 'du', 'sync', et= c. go into > the same state and never exit. It is impossible to kill any of these pro= cesses. >=20 > We can pretty much force this to happen by running a large 'find' or some= thing > similar on the exported file system, although it will happen itself event= ually > without any such commands being run. >=20 > Our only option (as far as we can tell) is to reboot the server, which re= sults in > a very long fsck period (it's over a terrabyte of disk space). >=20 > This doesn't seem to be a hardware issue. This is a brand new server in = all respects > (all new hardware, new RAID) and we saw the exact same issue on the machi= ne that it > replaced (which was running 4.11 on i386). >=20 > Any thoughts on this? Any more info I should provide? Please post your kernel config. Kris --tqI+Z3u+9OQ7kwn0 Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.3 (FreeBSD) iD8DBQFEf48VWry0BWjoQKURAmu+AKCnMNtnpRH8lhGPN/k8i4XTS3XS+QCfYYwn IzCpbIuBLLXjJkN0W1siSoU= =F4d3 -----END PGP SIGNATURE----- --tqI+Z3u+9OQ7kwn0--