Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 11 Aug 2007 18:22:22 +0900
From:      Pyun YongHyeon <pyunyh@gmail.com>
To:        Don Lewis <truckman@FreeBSD.org>
Cc:        current@FreeBSD.org
Subject:   Re: bizarre nfe(4) problem
Message-ID:  <20070811092222.GA22569@cdnetworks.co.kr>
In-Reply-To: <200708110542.l7B5gJm9058171@gw.catspoiler.org>
References:  <200708110542.l7B5gJm9058171@gw.catspoiler.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On Fri, Aug 10, 2007 at 10:42:19PM -0700, Don Lewis wrote:
 > I've a rather strange nfe(4) problem that appears to be repeatable.  I
 > recently started running -CURRENT on a older socket 754 motherboard with
 > the nForce3 chipset.  Initially, I was running an SMP kernel, but I had
 > problems with sporadic "nfe0: watchdog timeout (missed Tx interrupts) --
 > recovering" problems that would intermittently cause the system to lose
 > network connectivity which it would recover from.  The kernel was very
 > similar to GENERIC, with just the addition of "options DEBUG_VFS_LOCKS"
 > and the replacement of atapicd with atapicam.
 > 
 > The nfe0 problem totally went away when I removed "options SMP" and
 > "device apic" from the kernel configuration, except under the following
 > very specific circumstances:
 > 
 > 	A vncserver session using the GNOME desktop was started on the
 > 	system.
 > 	
 > 	There was no keyboard or mouse activity on the console for an
 > 	extended period of time, allowing the GNOME screen saver to kick
 > 	in and lock the screen.
 > 
 > The system would run fine in this state for many hours, and would accept
 > incoming SMTP connections, etc.
 > 
 > 	A remote vncclient makes a connection to the vncserver session
 > 	and the password was entired on the client.
 > 
 > At this point the nfe0 interface would appear to go deaf.  This might
 > happen before or slightly after the password dialog box appeared for the
 > vnc session.  For a short while, the system would be able to transmit
 > TCP packets, ntp queries, etc., but it would not respond to any incoming
 > packets (ping, TCP connection requests, etc.). Eventually, the ARP cache
 > would time out and the only packets being transmitted would be ARP
 > requests and the occasional UDP broadcast from the samba server running
 > on the machine.
 > 
 > Pressing any key on the (PS/2) keyboard would instantly bring the
 > network interface back to life.  Examination of /var/log/messages showed
 > lots of "nfe0: watchdog timeout" messages for the entire time that nfe0
 > was not listening to the network.
 > 
 > I've had this problem happen twice.  Both times were after an extended
 > period of console inactivity.   An incoming vnc connection is not
 > sufficient to trigger the problem if the console was recently active,
 > and even waiting for the GNOME screensaver to put the monitor in DPMS
 > power save mode before initiating the vnc connection does not appear to
 > be sufficient to trigger the problem.
 > 
 > I believe that nfe0 was sharing an interrupt with one of the USB ports
 > when the kernel was compiled with "device apic", but it is not sharing
 > an interrupt without "device apic".
 > 
 > Any thoughts on how to debug this problem?
 > 
 > 
 > # vmstat -i
 > interrupt                          total       rate
 > irq0: clk                       41903449       1000
 > irq1: atkbd0                       39034          0
 > irq3: ohci0                            5          0
 > irq7: ppc0                             2          0
 > irq8: rtc                        5362802        127
 > irq9: ohci1 ahc0+                1963559         46
 > irq10: nfe0+                      225593          5
              ^^
You have nfe0+ which indicates vmstat had run out of room to
display somthing. I'm not sure but it's still sharing interrupt
with other device?

 > irq11: drm0                      2511908         59
 > irq12: psm0                       332931          7
 > irq14: ata0                           48          0
 > Total                           52339331       1249
 > 

-- 
Regards,
Pyun YongHyeon



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20070811092222.GA22569>