From owner-freebsd-stable@FreeBSD.ORG Tue Sep 7 23:33:01 2010 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 5F6EC10656EB for ; Tue, 7 Sep 2010 23:33:01 +0000 (UTC) (envelope-from mahlon@martini.nu) Received: from martini.nu (martini.nu [198.145.180.83]) by mx1.freebsd.org (Postfix) with SMTP id 3E7E08FC0A for ; Tue, 7 Sep 2010 23:32:57 +0000 (UTC) Received: (qmail 12931 invoked by uid 1000); 7 Sep 2010 23:32:57 -0000 Date: Tue, 7 Sep 2010 16:32:57 -0700 From: "Mahlon E. Smith" To: Jeremy Chadwick Message-ID: <20100907233257.GA94092@martini.nu> Mail-Followup-To: "Mahlon E. Smith" , Jeremy Chadwick , freebsd-stable@freebsd.org, Yong-Hyeon PYUN References: <20100907210813.GI49065@martini.nu> <20100907222403.GA18595@icarus.home.lan> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="d6Gm4EdcadzBjdND" Content-Disposition: inline In-Reply-To: <20100907222403.GA18595@icarus.home.lan> X-GPG-Fingerprint: 19B8 DDB3 0156 3A03 FA80 8278 C0BE 6BFB 3606 B267 X-Sysinfo: FreeBSD 7.0-RELEASE-p1 i386 User-Agent: Mutt/1.5.17 (2007-11-01) Cc: Yong-Hyeon PYUN , freebsd-stable@freebsd.org Subject: Re: Network memory allocation failures X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 07 Sep 2010 23:33:01 -0000 --d6Gm4EdcadzBjdND Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Tue, Sep 07, 2010, Jeremy Chadwick wrote: >=20 > This could be a bce(4) bug, meaning the "failed to allocate memory" > message could be indicating DMA failure or something else from the card, > and not necessarily related to mbufs. >=20 > There are also changes/fixes to bce(4) that are in RELENG_8 (8.1-STABLE) > that aren't in 8.1-RELEASE, but I don't know if those are responsible > for your problem. Hmm, well -- I'm definitely not opposed to jumping to -STABLE if it might fix it. > Please provide output from the following: >=20 > * uname -a (if desired, XXX out hostname) FreeBSD jessage 8.1-RELEASE FreeBSD 8.1-RELEASE #2: Fri Aug 20 14:30:31 PDT= 2010 root@jessage:/usr/src/sys/amd64/compile/R810 amd64 Custom kernel, with additions to GENERIC (nothing removed): device carp device snp options HZ=3D1000 options DEVICE_POLLING options ALTQ options ALTQ_CBQ options ALTQ_PRIQ options SC_DISABLE_REBOOT options PANIC_REBOOT_WAIT_TIME=3D5 ALTQ and friends not actually active on the machine. I was fighting a different battle when running GENERIC, so I can't honestly recall if this problem existed then -- I'll make sure it is still happening under GENERIC for a baseline, to eliminate any potential weirdness with DEVICE_POLLING or the HZ timing. > * vmstat -i interrupt total rate irq19: ehci0 1547103 0 irq21: uhci1 uhci3+ 29 0 irq23: atapci0 35 0 irq32: mfi0 68104468 43 cpu0: timer 3093305346 1986 irq256: bce0 46587008 29 cpu19: timer 3103614834 1992 cpu1: timer 3093298527 1986 cpu4: timer 3093297557 1986 cpu10: timer 3089824707 1983 cpu12: timer 3097896788 1989 cpu16: timer 3097897232 1989 cpu22: timer 3103615267 1992 cpu2: timer 3093297601 1986 cpu5: timer 3093298349 1986 cpu3: timer 3093298637 1986 cpu6: timer 3089823402 1983 cpu18: timer 3103614571 1992 cpu13: timer 3097897961 1989 cpu20: timer 3103615299 1992 cpu23: timer 3103614783 1992 cpu9: timer 3089821582 1983 cpu17: timer 3097898138 1989 cpu11: timer 3089821712 1983 cpu14: timer 3097897190 1989 cpu7: timer 3089821360 1983 cpu21: timer 3103615012 1992 cpu15: timer 3097898081 1989 cpu8: timer 3089824487 1983 Total 74424047066 47788 > * ifconfig -a (if desired, XXX out IPs and MACs) bce0: flags=3D8943 metr= ic 0 mtu 1500 options=3Dc01bb ether 00:25:64:fd:0b:24 inet 10.5.2.69 netmask 0xfffffc00 broadcast 10.5.3.255 media: Ethernet autoselect (1000baseT ) status: active bce1: flags=3D8802 metric 0 mtu 1500 options=3Dc01bb ether 00:25:64:fd:0b:26 media: Ethernet autoselect (none) status: no carrier bce2: flags=3D8802 metric 0 mtu 1500 options=3Dc01bb ether 00:25:64:fd:0b:28 media: Ethernet autoselect (none) status: no carrier bce3: flags=3D8802 metric 0 mtu 1500 options=3Dc01bb ether 00:25:64:fd:0b:2a media: Ethernet autoselect (none) status: no carrier lo0: flags=3D8049 metric 0 mtu 16384 options=3D3 inet6 fe80::1%lo0 prefixlen 64 scopeid 0x5=20 inet6 ::1 prefixlen 128=20 inet 127.0.0.1 netmask 0xff000000=20 nd6 options=3D3 vboxnet0: flags=3D8802 metric 0 mtu 1500 ether 0a:00:27:00:00:00 > * netstat -inbd (if desired, XXX out MACs) Name Mtu Network Address Ipkts Ierrs Idrop Ib= ytes Opkts Oerrs Obytes Coll Drop bce0 1500 00:25:64:fd:0b:24 14467627 0 0 634654= 9588 11846499 0 4646920777 0 0=20 bce0 1500 10.5.0.0/22 10.5.2.69 1987644 - - 37163= 5478 415087 - 74168123 - -=20 bce1* 1500 00:25:64:fd:0b:26 0 0 0 = 0 0 0 0 0 0=20 bce2* 1500 00:25:64:fd:0b:28 0 0 0 = 0 0 0 0 0 0=20 bce3* 1500 00:25:64:fd:0b:2a 0 0 0 = 0 0 0 0 0 0=20 lo0 16384 25561 0 0 4733= 8756 25561 0 47338756 0 0=20 lo0 16384 fe80:5::1/64 fe80:5::1 0 - - = 0 0 - 0 - -=20 lo0 16384 ::1/128 ::1 0 - - = 0 0 - 0 - -=20 lo0 16384 127.0.0.0/8 127.0.0.1 25561 - - 4733= 8756 25561 - 47338756 - -=20 vboxn 1500 0a:00:27:00:00:00 0 0 0 = 0 0 0 0 0 0=20 > * pciconf -lvc (only the bceX entry please) bce0@pci0:1:0:0: class=3D0x020000 card=3D0x02d41028 chip=3D0x163= 914e4 rev=3D0x20 hdr=3D0x00 vendor =3D 'Broadcom Corporation' device =3D 'NetXtreme II Gigabit Ethernet (BCM5709)' class =3D network subclass =3D ethernet cap 01[48] =3D powerspec 3 supports D0 D3 current D0 cap 03[50] =3D VPD cap 05[58] =3D MSI supports 16 messages, 64 bit enabled with 1 mess= age cap 11[a0] =3D MSI-X supports 9 messages in map 0x10 cap 10[ac] =3D PCI-Express 2 endpoint max data 256(512) link x2(x4) =20 > Also check dmesg to see if there's any error messages that correlate > when the problem occurs. All quiet on that front. Thanks for the reply, Jeremy! -- Mahlon E. Smith =20 http://www.martini.nu/contact.html --d6Gm4EdcadzBjdND Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- iD8DBQFMhsup1bsjBDapbeMRAlMvAJ9WcgJ8TVjEOo5I6hdHUU4ZtNqKwQCgtz9r /v/BhP7d05P1DeLSpnwpHZI= =Wi9i -----END PGP SIGNATURE----- --d6Gm4EdcadzBjdND--