From owner-freebsd-net@FreeBSD.ORG Wed Jul 30 15:06:40 2003 Return-Path: Delivered-To: freebsd-net@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id A31E737B401 for ; Wed, 30 Jul 2003 15:06:40 -0700 (PDT) Received: from relay.pair.com (relay.pair.com [209.68.1.20]) by mx1.FreeBSD.org (Postfix) with SMTP id E4ED143FBD for ; Wed, 30 Jul 2003 15:06:39 -0700 (PDT) (envelope-from silby@silby.com) Received: (qmail 63698 invoked from network); 30 Jul 2003 22:06:38 -0000 Received: from niwun.pair.com (HELO localhost) (209.68.2.70) by relay.pair.com with SMTP; 30 Jul 2003 22:06:38 -0000 X-pair-Authenticated: 209.68.2.70 Date: Wed, 30 Jul 2003 17:05:49 -0500 (CDT) From: Mike Silbersack To: peter.lai@uconn.edu In-Reply-To: <20030730215823.GA361@cowbert.2y.net> Message-ID: <20030730170024.K88169@odysseus.silby.com> References: <20030730215823.GA361@cowbert.2y.net> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII cc: freebsd-net@freebsd.org Subject: Re: dc TX underrun leads to delayed crash X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 30 Jul 2003 22:06:41 -0000 On Wed, 30 Jul 2003, Peter C. Lai wrote: > I'm noticing on a moderately loaded system, that sometimes when the kernel > increases the TX threshold (/kernel: dc0: TX underrun -- increasing TX > threshold), a few minutes later, the system hardlocks requiring a reset. > This routinely happens when I'm streaming MP3s over the network and the box > suddenly hardlocks; after I go back to inspect the logs, the TX buffer underrun > is the only thing in the log before the start of the kernel reboot messages. > This is occuring on 4.8-STABLE as of July 7, 2003 on an AMD K6-2 500 with > 348 Mb RAM and VIA Apollo MVP3 chipset. When the lockups occur, the system > temperatures are below 40C, with little disk activity, moderate ram and cpu > usage; the NIC (linksys LNE-100TX A) is usually doing a steady 50K/s at this point. > -- > Peter C. Lai Rev 1.9.2.47 of if_dc.c (committed July 14th) should fix this problem for you. MBUF_STRESS_TEST showed similar symptoms as mbuf chain lengths were increased, which is how I detected the problem. Try grabbing the new if_dc.c: http://www.freebsd.org/cgi/cvsweb.cgi/~checkout~/src/sys/pci/if_dc.c?rev=1.9.2.47&content-type=text/plain And see how things go. Mike "Silby" Silbersack