From owner-freebsd-stable@FreeBSD.ORG Thu Oct 25 08:57:57 2007 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1977D16A417 for ; Thu, 25 Oct 2007 08:57:57 +0000 (UTC) (envelope-from oleg.lomaka@gmail.com) Received: from gnome.kiev.sovam.com (gnome.kiev.sovam.com [212.109.32.24]) by mx1.freebsd.org (Postfix) with ESMTP id A934113C4B0 for ; Thu, 25 Oct 2007 08:57:56 +0000 (UTC) (envelope-from oleg.lomaka@gmail.com) Received: from relay02.kiev.sovam.com ([62.64.120.197]) by gnome.kiev.sovam.com with esmtp (Exim 4.67 (FreeBSD)) (envelope-from ) id 1IkyXI-000AUS-T0 for freebsd-stable@freebsd.org; Thu, 25 Oct 2007 11:57:54 +0300 Received: from [212.82.216.226] (helo=skuns.kiev.zoral.com.ua) by relay02.kiev.sovam.com with esmtps (TLSv1:AES256-SHA:256) (Exim 4.67) (envelope-from ) id 1IkyXG-000Cu4-Lf for freebsd-stable@freebsd.org; Thu, 25 Oct 2007 11:57:51 +0300 Received: from tdevil.lomaka.org.ua (fc2.kiev.zoral.com.ua [10.1.1.7]) (authenticated bits=0) by skuns.kiev.zoral.com.ua (8.14.1/8.14.1) with ESMTP id l9P8uqCw043098 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Thu, 25 Oct 2007 11:56:53 +0300 (EEST) (envelope-from oleg.lomaka@gmail.com) Message-ID: <47205A56.6010601@gmail.com> Date: Thu, 25 Oct 2007 11:56:54 +0300 From: Oleg Lomaka User-Agent: Thunderbird 2.0.0.6 (X11/20070819) MIME-Version: 1.0 To: pyunyh@gmail.com References: <20071024084934.GF11234@cdnetworks.co.kr> <471F52DC.4080305@gmail.com> <20071025020637.GA16092@cdnetworks.co.kr> <47203EC3.4010203@gmail.com> <20071025083032.GE16092@cdnetworks.co.kr> In-Reply-To: <20071025083032.GE16092@cdnetworks.co.kr> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Scanned: ClamAV version 0.91.2, clamav-milter version 0.91.2 on skuns.kiev.zoral.com.ua X-Virus-Status: Clean X-Spam-Status: No, score=-4.4 required=5.0 tests=ALL_TRUSTED,BAYES_00 autolearn=ham version=3.2.3 X-Spam-Checker-Version: SpamAssassin 3.2.3 (2007-08-08) on skuns.kiev.zoral.com.ua X-Scanner-Signature: feb28a81ec0850514d099b7346a2573b X-DrWeb-checked: yes X-SpamTest-Envelope-From: oleg.lomaka@gmail.com X-SpamTest-Group-ID: 00000000 X-SpamTest-Info: Profiles 1669 [Oct 25 2007] X-SpamTest-Info: helo_type=3 X-SpamTest-Info: {received from trusted relay: not dialup} X-SpamTest-Method: none X-SpamTest-Method: Local Lists X-SpamTest-Rate: 0 X-SpamTest-Status: Not detected X-SpamTest-Status-Extended: not_detected X-SpamTest-Version: SMTP-Filter Version 3.0.0 [0255], KAS30/Release Cc: freebsd-stable@freebsd.org Subject: Re: any hope for nfe/msk? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 25 Oct 2007 08:57:57 -0000 Pyun YongHyeon wrote: > On Thu, Oct 25, 2007 at 09:59:15AM +0300, Oleg Lomaka wrote: > > Hello, > > > > Pyun YongHyeon wrote: > > >On Wed, Oct 24, 2007 at 05:12:44PM +0300, Oleg Lomaka wrote: > > > > Pyun YongHyeon wrote: > > > > >On Wed, Oct 24, 2007 at 09:33:48AM +0200, Danny Braniss wrote: > > > > > > Hi, > > > > > > these drivers don't work under 7.0 > > > > > > As soon as some mild preasure is applied, they start loosing > > > > > interrupts, and > > > > > > in my case the hosts come to a total stand-still, since they are > > > > > diskless > > > > > > and rely on the network. > > > > > > This happens at 1gb and at 100mg. > > > > > > > > > > > > Maybe the problem is with the shared interrups? > > > > > > > > > > > > irq16: mskc0 uhci0 3308351 13 > > > > > > or > > > > > > irq21: nfe0 ohci0 1584415 24 > > > > > > > > > > > > but I have no idea how to uncouple this > > > > > > > > > > > > > > > >If you see watchdog timeout errors on your console, shared interrupt > > > > >would be culprit. > > > > >For msk(4) set hw.msk.legacy_intr="1" in loader.conf or use kenv(1) > > > > >to set it before loading msk(4) kernel module. > > > > >For nfe(4) you can switch to polling(4). > > > > > > > > > > > > > > I have some msk troubles too. On my laptop (acer travelmate 2483wxmi) > > > > under heavy cpu & network load msk periodically stops working for few > > > > minutes. > > > > > >If that happens msk(4) recover from the non-working state? > > > > > Yes, some times in few seconds, some times in 5 - 10 minutes, but always > > recovers. > > > > sysctl -a|grep msk > > > > <118>msk0: no link ... > > > > <118>DHCPREQUEST on msk0 to 255.255.255.255 port 67 > > > > <118>DHCPREQUEST on msk0 to 255.255.255.255 port 67 > > > > <118>DHCPDISCOVER on msk0 to 255.255.255.255 port 67 interval 3 > > > > <118>DHCPREQUEST on msk0 to 255.255.255.255 port 67 > > > > <118>msk0: flags=8843 metric 0 > > > > mtu 1500 > > > > msk0: watchdog timeout (missed Tx interrupts) -- recovering > > > > msk0: watchdog timeout (missed Tx interrupts) -- recovering > > > > msk0: Rx FIFO overrun! > > > ^^^^^^^^^^^^^^^^ > > >This looks bad. Would you show me verbosed boot messages related with > > >msk(4) and PHY driver as well as "vmstat -i" output. > > > > > > > > Here are values from just booted laptop. If it will halt msk today > > again, I'll resend. > > > > tdevil% vmstat -i > > interrupt total rate > > irq1: atkbd0 3275 1 > > irq12: psm0 11157 6 > > irq14: ata0 22500 13 > > irq15: ata1 85 0 > > irq16: mskc0 uhci+ 17334 10 > > irq18: uhci2 1 0 > > irq22: pcm0 46530 27 > > irq23: uhci0 ehci0 95882 57 > > cpu0: timer 3322705 1999 > > Total 3519469 2117 > > > > > > tdevil% grep -iE "msk|phy" /var/run/dmesg.boot > > pci0: domain=0, physical bus=0 > > pci2: domain=0, physical bus=2 > > mskc0: port 0x2000-0x20ff mem > > 0xd0100000-0xd0103fff irq 16 at device 0.0 on pci2 > > mskc0: Reserved 0x4000 bytes for rid 0x10 type 3 at 0xd0100000 > > mskc0: MSI count : 2 > > mskc0: RAM buffer size : 16KB > > mskc0: Port 0 : Rx Queue 10KB(0x00000000:0x000027ff) > > mskc0: Port 0 : Tx Queue 10KB(0x00002800:0x00004fff) > > msk0: on mskc0 > > msk0: bpf attached > > msk0: Ethernet address: 00:1b:24:0e:bc:26 > > miibus0: on msk0 > > e1000phy0: PHY 0 on miibus0 > > e1000phy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto > > ukphy0: PHY 3 on miibus0 > > ukphy0: OUI 0x001000, model 0x0004, rev. 0 > > ukphy0: no media present > > ukphy1: PHY 6 on miibus0 > > ukphy1: OUI 0x004400, model 0x0011, rev. 0 > > ukphy1: no media present > > mskc0: [MPSAFE] > > mskc0: [FILTER] > > pci3: domain=0, physical bus=3 > > pci4: domain=0, physical bus=4 > > pci5: domain=0, physical bus=5 > > pci10: domain=0, physical bus=10 > > > > Thanks for the info. Would please try attached patch? > > After kldunload/kldload i've got following and had to revert to original one (1.18 revision). I'll try to reboot laptop in the evening with your patch. Is kernel reloading desirable? found-> vendor=0x104c, dev=0x8039, revid=0x00 domain=0, bus=10, slot=9, func=0 class=06-07-00, hdrtype=0x02, mfdev=1 cmdreg=0x0007, statreg=0x0210, cachelnsz=16 (dwords) lattimer=0x31 (1470 ns), mingnt=0x44 (17000 ns), maxlat=0x03 (750 ns) intpin=a, irq=20 powerspec 2 supports D0 D1 D2 D3 current D0 pci0:10:9:0: reprobing on driver added found-> vendor=0x104c, dev=0x803b, revid=0x00 domain=0, bus=10, slot=9, func=2 class=01-80-00, hdrtype=0x00, mfdev=1 cmdreg=0x0006, statreg=0x0210, cachelnsz=16 (dwords) lattimer=0x39 (1710 ns), mingnt=0x07 (1750 ns), maxlat=0x04 (1000 ns) intpin=a, irq=20 powerspec 2 supports D0 D1 D2 D3 current D0 pci0:10:9:2: reprobing on driver added msk0: on mskc0 msk0: failed to allocate DMA'able memory for jumbo buf device_attach: msk0 attach returned 1