From owner-freebsd-net@FreeBSD.ORG Sat Jun 23 07:41:14 2007 Return-Path: X-Original-To: freebsd-net@freebsd.org Delivered-To: freebsd-net@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 7576B16A400 for ; Sat, 23 Jun 2007 07:41:14 +0000 (UTC) (envelope-from jfvogel@gmail.com) Received: from wa-out-1112.google.com (wa-out-1112.google.com [209.85.146.180]) by mx1.freebsd.org (Postfix) with ESMTP id 3FECD13C4C1 for ; Sat, 23 Jun 2007 07:41:14 +0000 (UTC) (envelope-from jfvogel@gmail.com) Received: by wa-out-1112.google.com with SMTP id j37so1053808waf for ; Sat, 23 Jun 2007 00:41:13 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=UM9oJrnFFUqb6xep5M2bj1cIzdX7x1qmK4EN/+s2KNk2Irxg2k58WWq3z/ezq4zzG/fquxC6v//eZHnoa/ads463NEi5ViFv+wL0V58BoKBrMO2HRg+I1UxFHKz3thOB1+MdvoVSScmQe3GioKtg/6MOzUjAtzBnIHsTPoUWb30= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=T4lzq3c96hgtALA9daz6jAxlecVKrr7BUwcCgw/xW/tTFVUVtdLAC+v6QJhV+z/DMO3AjaMqYse6D59xOtn2LNE+a2IoH+kdVk3NDo7UiZ6iX77WkBxEwYVcf46Ukw8btFyqm7YuVRVhZzlIx/Rj+xokpop9JNnzVFS2Ip1fgV0= Received: by 10.114.13.1 with SMTP id 1mr3610601wam.1182584473903; Sat, 23 Jun 2007 00:41:13 -0700 (PDT) Received: by 10.114.103.14 with HTTP; Sat, 23 Jun 2007 00:41:13 -0700 (PDT) Message-ID: <2a41acea0706230041p54212458mdd2ab339fe9bffd9@mail.gmail.com> Date: Sat, 23 Jun 2007 00:41:13 -0700 From: "Jack Vogel" To: "Andrew Snow" In-Reply-To: <467C8C5F.8050206@modulus.org> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <467C8C5F.8050206@modulus.org> Cc: freebsd-net@freebsd.org Subject: Re: watchdog timeout problem with freebsd 6.2-stable and v6.4.1 if_em driver X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 23 Jun 2007 07:41:14 -0000 On 6/22/07, Andrew Snow wrote: > > Hi, I have a problem with Pro/1000 cards in Freebsd, as follows: > > System: Supermicro 1RU server > CPU: Intel(R) Core(TM)2 CPU 4400 @ 2.00GHz > OS: FreeBSD 6.2-STABLE (Tue May 29 03:19:28 EST 2007) > amd64 (64 bit mode, SMP kernel) > > Driver: 6.4.1 kernel module (downloaded from Intel's website) > > em0: flags=8843 mtu 1500em0: > port > 0x5000-0x501f mem 0xe0300000-0xe031ffff irq 16 at device 0.0 on pci13 > em0: Ethernet address: 00:30:48:8b:54:30 > em1: port > 0x6000-0x601f mem 0xe0400000-0xe041ffff irq 17 at device 0.0 on pci14 > em1: Ethernet address: 00:30:48:8b:54:31 > > em0@pci13:0:0: class=0x020000 card=0x108c15d9 chip=0x108c8086 rev=0x03 > hdr=0x00 > vendor = 'Intel Corporation' > device = 'PRO/1000 PM' > class = network > subclass = ethernet > em1@pci14:0:0: class=0x020000 card=0x109a15d9 chip=0x109a8086 rev=0x00 > hdr=0x00 > vendor = 'Intel Corporation' > class = network > subclass = ethernet > > > > em1 is connected to another Pro/1000 network card and works properly > without any problems whatsoever! > > em0 is connected to a Netgear Gigabit SmartSwitch and all our cables are > Belkin CAT6 cables. We have TWO identical machines, and they both > suffer the "watchdog timeout" problem as follows: > > After medium-heavy traffic, the NIC locks up completely and no traffic > passes for a long time, perhaps longer than half an hour. > > Then, it recovers and prints this to syslog: > em0: watchdog timeout -- resetting > em0: link state changed to DOWN > em0: link state changed to UP > > The problem was much worse with the supplied drivers of FreeBSD 6.2, any > large packets would trigger it, but upgrading to 6.4.1 helped alot. > > The problem still happens with SMP completely disabled. The only > workaround which is completely successful for me so far is to disable > gigabit and use 100baseTX instead. > > The netgear switch is configured for VLANs but VLAN outbound tagging is > not enabled for this switch port and the em0 interface is not configured > for VLANs. > > > Any ideas? Bit by the 82573 eeprom MANC setting problem :( If you search thru the email archives you will find that I have posted a DOS patcher that fixes the problem. Search on 82573, if you have a problem let me know. What happens is that the eeprom mis-programs the MANC (management controller) register so it gobbles up packets rather than hand up to the host OS. The patcher should solve your problems, if not you know who to call :) Cheers, Jack