From owner-freebsd-hackers@FreeBSD.ORG Mon Apr 16 06:33:22 2007 Return-Path: X-Original-To: freebsd-hackers@freebsd.org Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 81FAA16A402; Mon, 16 Apr 2007 06:33:22 +0000 (UTC) (envelope-from youshi10@u.washington.edu) Received: from mxout7.cac.washington.edu (mxout7.cac.washington.edu [140.142.32.178]) by mx1.freebsd.org (Postfix) with ESMTP id 5F55E13C43E; Mon, 16 Apr 2007 06:33:22 +0000 (UTC) (envelope-from youshi10@u.washington.edu) Received: from smtp.washington.edu (smtp.washington.edu [140.142.32.139]) by mxout7.cac.washington.edu (8.13.7+UW06.06/8.13.7+UW07.03) with ESMTP id l3G6XLHL002607 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK); Sun, 15 Apr 2007 23:33:21 -0700 X-Auth-Received: from [192.168.10.45] (c-24-7-142-221.hsd1.ca.comcast.net [24.7.142.221]) (authenticated authid=youshi10) by smtp.washington.edu (8.13.7+UW06.06/8.13.7+UW07.03) with ESMTP id l3G6XL2J021135 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NOT); Sun, 15 Apr 2007 23:33:21 -0700 Message-ID: <462318CB.3030205@u.washington.edu> Date: Sun, 15 Apr 2007 23:33:47 -0700 From: Garrett Cooper User-Agent: Thunderbird 1.5.0.10 (Windows/20070221) MIME-Version: 1.0 To: freebsd-hackers@freebsd.org References: <20070414184719.110deaa2@deimos.mars.bsd> <46217486.6080801@u.washington.edu> <20070415161753.7c7a604d@deimos.mars.bsd> In-Reply-To: <20070415161753.7c7a604d@deimos.mars.bsd> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-PMX-Version: 5.3.0.289146, Antispam-Engine: 2.5.0.283055, Antispam-Data: 2007.4.15.232233 X-Uwash-Spam: Gauge=IIIIIII, Probability=7%, Report='__CP_URI_IN_BODY 0, __CT 0, __CTE 0, __CT_TEXT_PLAIN 0, __HAS_MSGID 0, __MIME_TEXT_ONLY 0, __MIME_VERSION 0, __SANE_MSGID 0, __USER_AGENT 0' Cc: Alejandro Pulver Subject: Re: High disk load +mount/atacontrol/NFS/SMBFS crashes the system X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 16 Apr 2007 06:33:22 -0000 Alejandro Pulver wrote: > On Sat, 14 Apr 2007 17:40:38 -0700 > Garrett Cooper wrote: > >> Alejandro Pulver wrote: >>> Hello. >>> >>> I have experienced the following problem a couple of times in 2 >>> different machines and FreeBSD versions (see below): when the disk is >>> continuously reading/writing (like when copying/extracting a file, >>> checking the filesystem in the background, etc.) my system crashes >>> sometimes (it's not an everyday thing, but quite frustrating when it >>> happens). >>> >>> When copying from another machine by NFS/SMBFS more than one file at >>> the same time (or when using the disk, like described above) often >>> crashes (and the disk light indicator turns off). Running "atacontrol >>> ad0 mode UDMA100" when it was UDMA133 crashed the system (the disk >>> activity indicator was always on) when I tried to solve the problem >>> that way. Also when I was installing a port which installs many files >>> on the second machine without using NFS/SMBFS, trying to mount a local >>> NTFS filesystem (with kernel driver) crashed. >>> > [...] >> Ale, > > Hello. > > Thank you for your reply. > >> Could you provide more information about your machine, in particular >> the devices attached (lspci -vv from sysutils/pciutils does the trick) >> and the options enabled in your custom kernel please? > > Sure. I have updated the file (added pci_machine_1.txt and > pci_machine_2.txt). The kernel configuration is already there (named > ATHLON-PHOBOS), the second machine has a default SMP kernel. > > http://people.freebsd.org/~alepulver/disk-crash.tar.bz2 > >> Also, could you provide more information about what the settings are >> that you are using for NFS and SMBFS (-rsize, -wsize, special >> mountd/rpcbind options, etc). >> -Garrett > > I am not using nothing special here. In rc.conf: > > rpcbind_enable="YES" > nfs_server_enable="YES" > nfs_client_enable="YES" > > And the commands (at different times): > > # mount deimos:/wxp /mnt > # mount -t smbfs //administrator@mariana/c /mnt > > After both FreeBSD machines crashed when the problem happened (because > of the NFS waiting infinitely), I started using "-i". The second > command was to copy some data from a Windows machine. > > BTW I don't think the problem is related to NFS/SMBFS but to the disk > drivers, since it happens without them too. One is ATA (has an year) > and the other is SATA (new). However I am not experienced in this to > tell. > > Thanks and Best Regards, > Ale Ale, I'm not sure what's going on exactly based on the information you provided, but I would try the following steps to isolate the issue: 1) See if you can upgrade the first machine to a later version of FreeBSD, say 6.2. I believe that there were related issues resolved in 6.2, but my memory could be incorrect. See if your problems occur after that. 2) Try grabbing a different machine if possible and see if the same issue occurs when you put the new machine as server and client with one of the other machines. 3) Try switching roles with the 2 machines. If machine 1 is usually server, let it play client and vice versa with machine 2. 4) Remove the new drive if possible, see if issue goes away. If it does, try acquiring a cheap(er) drive and put it Also, it appears that another FreeBSD team member had a similar issue (see: http://people.freebsd.org/~pho/stress/log/cons205.html and http://people.freebsd.org/~pho/stress/log/cons225.html). I dunno how but it showed up as one of the leading searches on Google. It looks like a (localized) filesystem issue, but I'm not sure what it is exactly. -Garrett