From owner-freebsd-net@FreeBSD.ORG Mon Apr 4 16:58:50 2011 Return-Path: Delivered-To: freebsd-net@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 753AB106566C for ; Mon, 4 Apr 2011 16:58:50 +0000 (UTC) (envelope-from jfvogel@gmail.com) Received: from mail-vx0-f182.google.com (mail-vx0-f182.google.com [209.85.220.182]) by mx1.freebsd.org (Postfix) with ESMTP id 1F0278FC1B for ; Mon, 4 Apr 2011 16:58:49 +0000 (UTC) Received: by vxc34 with SMTP id 34so5289045vxc.13 for ; Mon, 04 Apr 2011 09:58:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:cc:content-type; bh=5aMpeGhD7anxJ0zl6++Vc1YgrSd21zRc3+esOGZ1g0k=; b=fW3nfjKzwWEnoaFnCC3f9RrzN/ium5zMrzck56DjT1/6LQ1Pi1Nu1NH7ORyLqWqnxx z3BFLZSosHWtH+8FzUI1l9R7/4WAJRH359cXFoXJ+qrYDIE8NkGstypHMIHAp2speDDP 0YCMgw+G8AKc8zP+4awkbx7HMLTXqvsFdjY8E= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; b=ouccCXMZkLyZyQo7Sqt8RdbrfB2y8yK6pK6oG4kqWTDn3HGjkfjSFH56dQBvuNqQjB 3iMGlncPqp1MVXveIlGKQ4IvNbn0ghOFqB0gxH+wwnaB0mTYfMhe/dJaSmS/icp0ERQv vNX5NIPlyIsw+9hqlzGLaO/85UTNFpt6Dlpwg= MIME-Version: 1.0 Received: by 10.52.92.161 with SMTP id cn1mr10102580vdb.253.1301936328825; Mon, 04 Apr 2011 09:58:48 -0700 (PDT) Received: by 10.52.167.6 with HTTP; Mon, 4 Apr 2011 09:58:48 -0700 (PDT) In-Reply-To: References: Date: Mon, 4 Apr 2011 09:58:48 -0700 Message-ID: From: Jack Vogel To: Arnaud Lacombe Content-Type: text/plain; charset=ISO-8859-1 X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: freebsd-net@freebsd.org Subject: Re: em(4) hang [Was: Re: igb(4) won't start with "igb0: Could not setup receive structures"] X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 04 Apr 2011 16:58:50 -0000 Thanks for the good news, this change will need to go into the other drivers also. Jack On Mon, Apr 4, 2011 at 9:11 AM, Arnaud Lacombe wrote: > Hi Jack, > > On Fri, Apr 1, 2011 at 2:55 PM, Jack Vogel wrote: > > Arnaud, > > > > Please try the code change I just checked into HEAD, it should finally > > resolve any hang that is due to mbufs not being refreshed. That's not > > to say there may not be other reasons out there but I'm keeping my > > fingers crossed that this is behind at least some of the hangs. > > > Thanks! > > I backported these latest changes to 7-stable on Friday and re-ran > tests. The box, and em0, has been fine for 3 days with the same test > pattern that made it hang after a few hour previously. > > Some stats: > # sysctl dev.em.0 > dev.em.0.%desc: Intel(R) PRO/1000 Network Connection 7.2.3 > dev.em.0.%driver: em > dev.em.0.%location: slot=0 function=0 > dev.em.0.%pnpinfo: vendor=0x8086 device=0x10d3 subvendor=0x8086 > subdevice=0x0000 class=0x020000 > dev.em.0.%parent: pci1 > dev.em.0.nvm: -1 > dev.em.0.debug: -1 > dev.em.0.rx_int_delay: 0 > dev.em.0.tx_int_delay: 66 > dev.em.0.rx_abs_int_delay: 66 > dev.em.0.tx_abs_int_delay: 66 > dev.em.0.rx_processing_limit: 100 > dev.em.0.flow_control: 3 > dev.em.0.eee_control: 0 > dev.em.0.link_irq: 20516 > dev.em.0.mbuf_alloc_fail: 0 > dev.em.0.cluster_alloc_fail: 0 > dev.em.0.dropped: 0 > dev.em.0.tx_dma_fail: 0 > dev.em.0.rx_overruns: 0 > dev.em.0.watchdog_timeouts: 0 > dev.em.0.device_control: 1477444168 > dev.em.0.rx_control: 67141634 > dev.em.0.fc_high_water: 18432 > dev.em.0.fc_low_water: 16932 > dev.em.0.queue0.txd_head: 175 > dev.em.0.queue0.txd_tail: 175 > dev.em.0.queue0.tx_irq: 433666694 > dev.em.0.queue0.no_desc_avail: 0 > dev.em.0.queue0.rxd_head: 654 > dev.em.0.queue0.rxd_tail: 653 > dev.em.0.queue0.rx_irq: 280606824 > dev.em.0.mac_stats.excess_coll: 0 > dev.em.0.mac_stats.single_coll: 0 > dev.em.0.mac_stats.multiple_coll: 0 > dev.em.0.mac_stats.late_coll: 0 > dev.em.0.mac_stats.collision_count: 0 > dev.em.0.mac_stats.symbol_errors: 0 > dev.em.0.mac_stats.sequence_errors: 0 > dev.em.0.mac_stats.defer_count: 0 > dev.em.0.mac_stats.missed_packets: 61034 > dev.em.0.mac_stats.recv_no_buff: 1974 > dev.em.0.mac_stats.recv_undersize: 0 > dev.em.0.mac_stats.recv_fragmented: 0 > dev.em.0.mac_stats.recv_oversize: 0 > dev.em.0.mac_stats.recv_jabber: 0 > dev.em.0.mac_stats.recv_errs: 0 > dev.em.0.mac_stats.crc_errs: 0 > dev.em.0.mac_stats.alignment_errs: 0 > dev.em.0.mac_stats.coll_ext_errs: 0 > dev.em.0.mac_stats.xon_recvd: 0 > dev.em.0.mac_stats.xon_txd: 21 > dev.em.0.mac_stats.xoff_recvd: 0 > dev.em.0.mac_stats.xoff_txd: 57286 > dev.em.0.mac_stats.total_pkts_recvd: 541037465 > dev.em.0.mac_stats.good_pkts_recvd: 540976431 > dev.em.0.mac_stats.bcast_pkts_recvd: 216 > dev.em.0.mac_stats.mcast_pkts_recvd: 0 > dev.em.0.mac_stats.rx_frames_64: 527887692 > dev.em.0.mac_stats.rx_frames_65_127: 13088739 > dev.em.0.mac_stats.rx_frames_128_255: 0 > dev.em.0.mac_stats.rx_frames_256_511: 0 > dev.em.0.mac_stats.rx_frames_512_1023: 0 > dev.em.0.mac_stats.rx_frames_1024_1522: 0 > dev.em.0.mac_stats.good_octets_recvd: 34785499386 > dev.em.0.mac_stats.good_octets_txd: 909161087533 > dev.em.0.mac_stats.total_pkts_txd: 761197293 > dev.em.0.mac_stats.good_pkts_txd: 761139986 > dev.em.0.mac_stats.bcast_pkts_txd: 1 > dev.em.0.mac_stats.mcast_pkts_txd: 0 > dev.em.0.mac_stats.tx_frames_64: 42029962 > dev.em.0.mac_stats.tx_frames_65_127: 40597084 > dev.em.0.mac_stats.tx_frames_128_255: 19591094 > dev.em.0.mac_stats.tx_frames_256_511: 31040789 > dev.em.0.mac_stats.tx_frames_512_1023: 69157381 > dev.em.0.mac_stats.tx_frames_1024_1522: 558723676 > dev.em.0.mac_stats.tso_txd: 0 > dev.em.0.mac_stats.tso_ctx_fail: 0 > dev.em.0.interrupts.asserts: 16073 > dev.em.0.interrupts.rx_pkt_timer: 0 > dev.em.0.interrupts.rx_abs_timer: 0 > dev.em.0.interrupts.tx_pkt_timer: 0 > dev.em.0.interrupts.tx_abs_timer: 1 > dev.em.0.interrupts.tx_queue_empty: 0 > dev.em.0.interrupts.tx_queue_min_thresh: 0 > dev.em.0.interrupts.rx_desc_min_thresh: 0 > dev.em.0.interrupts.rx_overrun: 3 > > # netstat -m > 33905/48040/81945 mbufs in use (current/cache/total) > 30335/2433/32768/32768 mbuf clusters in use (current/cache/total/max) > 2781/348 mbuf+clusters out of packet secondary zone in use (current/cache) > 3194/9607/12801/12800 4k (page size) jumbo clusters in use > (current/cache/total/max) > 0/0/0/6400 9k jumbo clusters in use (current/cache/total/max) > 0/0/0/3200 16k jumbo clusters in use (current/cache/total/max) > 83770K/55304K/139074K bytes allocated to network (current/cache/total) > 0/31564/0 requests for mbufs denied (mbufs/clusters/mbuf+clusters) > 0/0/0 requests for jumbo clusters denied (4k/9k/16k) > 0/7/6656 sfbufs in use (current/peak/max) > 0 requests for sfbufs denied > 0 requests for sfbufs delayed > 0 requests for I/O initiated by sendfile > 0 calls to protocol drain routines > > - Arnaud >