Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 23 Oct 2010 09:33:20 +0000 (UTC)
From:      "Bjoern A. Zeeb" <bzeeb-lists@lists.zabbadoz.net>
To:        Mike Tancsa <mike@sentex.net>
Cc:        Randy Bush <randy@psg.com>, stable@freebsd.org, Jack Vogel <jfvogel@gmail.com>
Subject:   Re: repeating crashes with 8.1
Message-ID:  <20101023091555.W66242@maildrop.int.zabbadoz.net>
In-Reply-To: <201010230821.o9N8LVuR001382@lava.sentex.ca>
References:  <m2zku7cqt5.wl%randy@psg.com> <m2y69rcqjc.wl%randy@psg.com> <201010221416.o9MEGSa0094817@lava.sentex.ca> <m2tykeb9ac.wl%randy@psg.com> <201010221425.o9MEPcWC094867@lava.sentex.ca> <m2k4lab6nh.wl%randy@psg.com> <201010221848.o9MIm7WF096197@lava.sentex.ca> <m2y69q9e38.wl%randy@psg.com> <4CC1F3B8.3010302@bogus.com> <4CC225D3.1030502@ops-netman.net> <7.1.0.9.0.20101022210145.06fe25e8@sentex.net> <201010230159.o9N1xGGF098363@lava.sentex.ca> <AANLkTimWTTHWC04my3CSoNGYsLarS9F10eoO=8Fz37cF@mail.gmail.com> <201010230821.o9N8LVuR001382@lava.sentex.ca>

next in thread | previous in thread | raw e-mail | index | archive | help
On Sat, 23 Oct 2010, Mike Tancsa wrote:

Hi all,

> At 12:41 AM 10/23/2010, Jack Vogel wrote:
>> Odd, can you make any connection between this and the em complaints??
>
> I dont think so.

neither do I think so.  The initial issue

> This is on an igb nic and a different panic/behaviour. I 
> have the box sitting at the debugger prompt in the FreeBSD netperf cluster, 
> so hopefully someone can take a look and see what is the issue.

Ok, and I seem to have another email telling me the machine;-)


>> >>>>> Oct 22 02:06:02 i4 kernel: em1: discard frame w/o packet header
>> >>>>> Oct 22 02:06:10 i4 kernel: em2: discard frame w/o packet header

is if_em(4) receiving a packet and then passing it up in em_rxeof()
by calling ifp->if_input ending up in
sys/net/if_ethersubr.c:ether_input() and there's an early check on
1) that the interface is up, and 2)
  580         /*
  581          * Do consistency checks to verify assumptions
  582          * made by code past this point.
  583          */
  584         if ((m->m_flags & M_PKTHDR) == 0) {
  585                 if_printf(ifp, "discard frame w/o packet header\n");
  586                 ifp->if_ierrors++;
  587                 m_freem(m);
  588                 return;
  589         }

So whatever mbuf if_em(4) is passing up here isn't setup correctly and
it sounds like the mbuf chaining logic or receive ring handling could
have a problem, though I doubt that em_refresh_mbufs() would be at
fault ... but I'll leave that to Jack.



For the following one, I hope to know the problem for it and am
working on fixing this already.  This is most likely more new-arp
fallout and the last we currrently know of (despite some nd6 locking
problems which will need to be addressed separately and are under
discussion)

>> Fatal trap 9: general protection fault while in kernel mode
>> cpuid = 0; apic id = 00
>> instruction pointer     = 0x20:0xffffffff80740a50
>> stack pointer           = 0x28:0xffffff800005a890
>> frame pointer           = 0x28:0xffffff800005a930
>> 
>> code segment            = base 0x0, limit 0xfffff, type 0x1b
>>                        = DPL 0, pres 1, long 1, def32 0, gran 1
>> processor eflags        = interrupt enabled, resume, IOPL = 0
>> current process         = 12 (swi4: clock)
>> [thread pid 12 tid 100007 ]
>> Stopped at      in6_cksum+0x410:        movzwl  (%rsi),%r10d
>> db> bt
>> Tracing pid 12 tid 100007 td 0xffffff00025083e0
>> in6_cksum() at in6_cksum+0x410
>> icmp6_reflect() at icmp6_reflect+0x312
>> icmp6_error() at icmp6_error+0x1ec
>> nd6_llinfo_timer() at nd6_llinfo_timer+0x208
>> softclock() at softclock+0x2a6
>> intr_event_execute_handlers() at intr_event_execute_handlers+0x66
>> ithread_loop() at ithread_loop+0xb2
>> fork_exit() at fork_exit+0x12a
>> fork_trampoline() at fork_trampoline+0xe
>> --- trap 0, rip = 0, rsp = 0xffffff800005ad30, rbp = 0 ---
>> db>

Mike, as you found that in a lab setup and I was never able to
reproduce any of the reports before, I'll send you a patch over
the weekend and we'll see if you can still reproduce it after
patching.

/bz

-- 
Bjoern A. Zeeb                              Welcome a new stage of life.
         <ks> Going to jail sucks -- <bz> All my daemons like it!
   http://www.freebsd.org/doc/en_US.ISO8859-1/books/handbook/jails.html



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20101023091555.W66242>