From owner-freebsd-stable@FreeBSD.ORG Wed Nov 8 16:26:48 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id DC2AB16A416; Wed, 8 Nov 2006 16:26:48 +0000 (UTC) (envelope-from scottl@samsco.org) Received: from pooker.samsco.org (pooker.samsco.org [168.103.85.57]) by mx1.FreeBSD.org (Postfix) with ESMTP id CABF443D4C; Wed, 8 Nov 2006 16:26:45 +0000 (GMT) (envelope-from scottl@samsco.org) Received: from [192.168.254.11] (phobos.samsco.home [192.168.254.11]) (authenticated bits=0) by pooker.samsco.org (8.13.4/8.13.4) with ESMTP id kA8GQSIS028271; Wed, 8 Nov 2006 09:26:33 -0700 (MST) (envelope-from scottl@samsco.org) Message-ID: <45520532.3000603@samsco.org> Date: Wed, 08 Nov 2006 09:26:26 -0700 From: Scott Long User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X; en-US; rv:1.8.0.7) Gecko/20060910 SeaMonkey/1.0.5 MIME-Version: 1.0 To: Nikolay Pavlov , Jack Vogel , Adrian Chadd , freebsd-stable@freebsd.org References: <68011C68-0962-4946-88E1-F36EE7C707DA@redstarling.com> <20061106221219.GA66676@hugo10.ka.punkt.de> <041201c701f9$37b2aed0$9603a8c0@claylaptop> <2a41acea0611061614n478efe77y82c0ebc2e1b01e19@mail.gmail.com> <2a41acea0611062242h42b1bde6w711e9a5039ed1a90@mail.gmail.com> <20061108144003.GA43734@zone3000.net> <20061108154102.GA40238@icarus.home.lan> In-Reply-To: <20061108154102.GA40238@icarus.home.lan> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-1.4 required=3.8 tests=ALL_TRUSTED autolearn=failed version=3.1.1 X-Spam-Checker-Version: SpamAssassin 3.1.1 (2006-03-10) on pooker.samsco.org Cc: Subject: Re: em driver testing X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 08 Nov 2006 16:26:49 -0000 Jeremy Chadwick wrote: > On Wed, Nov 08, 2006 at 04:40:03PM +0200, Nikolay Pavlov wrote: >> Well i have 5.5 box with very similar symptomatic :) >> I do not see watchdog timeouts on it, but a lot of UP/DOWN events. > > Are you sure this is the same problem as what's being discussed > here? If you revert to a previous kernel or em driver, does the > problem (link up/down) go away? Are you sure you don't actually > have a flaky cable or RJ45 connector? What does the switch your > NIC is connected to say? (does it show link going up and down) > > I feel horrible for both Scott and Jack -- I think there's tons > of people coming out of the woodwork with "ME TOO" comments who > may in fact be suffering from other problems, and are looking for > a scapegoat thread. > The timeout/watchdog mechanism in the interface layer has been a problem ever since the MPSAFE work was done on the network stack. It's prone to races, and as the OS has improved and gotten faster over the past 2 years, those races have gotten bigger. In a way, it's a actually a positive indication of progress and improvement =-) I don't doubt that there are users with other problems. We spent some time collecting as much user data as we could in order to find patterns and weed out the uncommon cases. But this timer/watchdog thing looks to be a strong candidate for being the root cause of many of the problems. We'll continue to investigate these problems and address other drivers. Scott