From owner-freebsd-stable@FreeBSD.ORG Tue Apr 22 20:14:08 2008 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 2F82F1065670 for ; Tue, 22 Apr 2008 20:14:08 +0000 (UTC) (envelope-from pluknet@gmail.com) Received: from nf-out-0910.google.com (nf-out-0910.google.com [64.233.182.185]) by mx1.freebsd.org (Postfix) with ESMTP id ABDE28FC13 for ; Tue, 22 Apr 2008 20:14:07 +0000 (UTC) (envelope-from pluknet@gmail.com) Received: by nf-out-0910.google.com with SMTP id b2so973611nfb.33 for ; Tue, 22 Apr 2008 13:14:06 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; bh=ZVC0SA65AMGR5yU0Uhl9NgxquXInTqq+61dvptVGqlo=; b=Sk/T+aIu3LimF1JIjf64OoMIafBBau8supAZmGj3W+Y58Dig3v+SFpxCMIV9ESm6UvY5A53+dE+ZNuCeg/MCKZEgJ/skhUvS5mV/Wol5YfmiNffFtPiDrxJH725rT1z4SvZ3ZtYbehfuPwWOtCil0FBMLTvj3HP1ZrxNv+tVW1Y= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=EJZ/lW4/QhTd1eKZkXag1+Kl2h/rFwoFwwM4C/jUB/d0PISW0xEbIK9yhjbagpAb6FQtoIxVQHIx1xZmx1miUz8HQlSQ6fKWwrl+b7x8pYkpQrvj+HPqO3wATrO2PpCd/rLUcqgD0Yp8f1rCTcwAy83wDJB5flmDLknA9yh544A= Received: by 10.78.197.9 with SMTP id u9mr5662huf.56.1208895224806; Tue, 22 Apr 2008 13:13:44 -0700 (PDT) Received: by 10.78.16.10 with HTTP; Tue, 22 Apr 2008 13:13:44 -0700 (PDT) Message-ID: Date: Wed, 23 Apr 2008 00:13:44 +0400 From: pluknet To: "Mike Tancsa" In-Reply-To: <200804221807.m3MI73bN092981@lava.sentex.ca> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <20080421094718.GY25623@hub.freebsd.org> <200804211537.m3LFbaZA086977@lava.sentex.ca> <200804221501.m3MF1guW092221@lava.sentex.ca> <200804221741.m3MHfYjO092795@lava.sentex.ca> <200804221807.m3MI73bN092981@lava.sentex.ca> Cc: stable@freebsd.org Subject: Re: nfs-server silent data corruption X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 22 Apr 2008 20:14:08 -0000 On 22/04/2008, Mike Tancsa wrote: > At 02:00 PM 4/22/2008, Arno J. Klaassen wrote: > > > > > > > Are you using the latest RELENG_7, or at least the latest version of > > > nfe thats in RELENG_7 ? > > > > > > Think so : > > > > OK, and it is the latest RELENG_7 ? Or just the if_nfe.c file has been > manually updated ? Also, you are using ULE or the 4BSD scheduler ? I still > have 4BSD on the box I am testing on. Hi, I have the same problem with data corruption (with nfe on nfs server side), particularly when transferring large files. Maybe this is somehow associated with the topic. My simple test case: truncate -s 1000m bigfile ^^ here I get zero-filed file cp bigfile /nfs/mounted ^^ here I get not-at-all-zero-filed file, after uploading to nfs server I looked at the corrupted file. It contains a few ranges, filed with non-zero bytes: equal to zero? real 4-byte value offset ====================================== not equal 1200355616 at pos=38797316 ... <-- this range contains per-4bytes garbage, omit not equal 3879749905 at pos=38813696 not equal 161160732 at pos=45613060 ... <-- ditto not equal 575257183 at pos=45629440 not equal 1943682165 at pos=59768836 ... <-- ditto not equal 2843639625 at pos=59785216 not equal 2653910121 at pos=60293124 ... <-- ditto not equal 3462830780 at pos=60309504 Some info: nfs server on 8-CURRENT as of Apr 17 nfs client on 7.0-STABLE as of Apr 12 dmesg | grep nfe nfe0: port 0xe000-0xe007 mem 0xe2001000-0xe2001fff irq 20 at device 4.0 on pci0 miibus0: on nfe0 nfe0: Ethernet address: 00:04:61:6c:76:b1 nfe0: [FILTER] nfe0: tx v1 error 0x6001 nfe0: tx v1 error 0x6001 nfe0: tx v1 error 0x6001 nfe0: tx v1 error 0x6001 nfe0: tx v1 error 0x6001 nfe0: tx v1 error 0x6001 nfe0: tx v1 error 0x6001 nfe0: tx v1 error 0x6001 nfe0: tx v1 error 0x6001 nfe0: tx v1 error 0x6001 nfe0: tx v1 error 0x6001 nfe0: tx v1 error 0x6001 ^^^ This appears while cp'ing file to server. (btw they do not appear with disabled polling, probably it's an another issue) vmstat -i | grep nfe irq20: nfe0 ohci0 1 0 nfe0: flags=8843 metric 0 mtu 1500 options=48 ether 00:04:61:6c:76:b1 inet 192.168.200.137 netmask 0xffffff00 broadcast 192.168.200.255 media: Ethernet autoselect (100baseTX ) status: active I can reproduce it regardless polling presence. nfe0@pci0:0:4:0: class=0x020000 card=0x10001695 chip=0x006610de rev=0xa1 hdr=0x00 wbr, pluknet