From owner-freebsd-current@FreeBSD.ORG Fri Jan 13 14:38:00 2012 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id CB31F106564A for ; Fri, 13 Jan 2012 14:38:00 +0000 (UTC) (envelope-from cracauer@koef.zs64.net) Received: from koef.zs64.net (koef.zs64.net [IPv6:2001:470:1f0b:105e::1e6]) by mx1.freebsd.org (Postfix) with ESMTP id 82C3F8FC16 for ; Fri, 13 Jan 2012 14:38:00 +0000 (UTC) Received: from koef.zs64.net (koef.zs64.net [IPv6:2001:470:1f0b:105e::1e6]) by koef.zs64.net (8.14.5/8.14.4) with ESMTP id q0DEbC6U066593; Fri, 13 Jan 2012 14:37:12 GMT (envelope-from cracauer@koef.zs64.net) Received: (from cracauer@localhost) by koef.zs64.net (8.14.5/8.14.4/Submit) id q0DEbBdL066592; Fri, 13 Jan 2012 09:37:11 -0500 (EST) (envelope-from cracauer) Date: Fri, 13 Jan 2012 09:37:11 -0500 From: Martin Cracauer To: Daniel Braniss Message-ID: <20120113143711.GA62486@cons.org> References: <20120111165714.GA46594@cons.org> <20120111182110.GA75991@cons.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4.2.3i Cc: Martin Cracauer , freebsd-current@freebsd.org, Stefan Bethke Subject: Re: Data corruption over NFS in -current X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 13 Jan 2012 14:38:00 -0000 More findings. Reminder, with the original report I found: - files for no reason changing ownership and group to root/ - data corruption as in inserting binary junk obviously from ports - data corruption as in malformed ascii text that might be a bug I have in my code that is only exposed in FreeBSD I ran the script on a Linux machine in the same situation again the same NFS server, it worked fine. I haven't look at blocksizes, NFS versions etc in play yet. I ran with oldnfs (reboot), which showed only the third problem. I re-ran with newfs (reboot) which worked (all three problems absent). I then started building ports/land/gcc47 at the same time as I re-started my crazy script and it too only a few seconds for an unexpected ownership to root to occur. My next steps are: - trying block sizes and other parameters, maybe use a different NFS version with the Linux client. My NFS server is newly upgraded to Linux kernel 3.1.5 - running my script on a FreeBSD host with local disk to see whether problem #3 is a general problem that appears or is exposed only on FreeBSD - capture tcpdump as mentioned earlier I will probably have to turn debug off since this script run is dominated by system time now and gets 10x slower as it is now. Martin -- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% Martin Cracauer http://www.cons.org/cracauer/