Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 24 Dec 2006 15:58:40 +0100
From:      Andre Oppermann <andre@freebsd.org>
To:        Bruce Evans <bde@zeta.org.au>
Cc:        Scott Long <scottl@samsco.org>, src-committers@freebsd.org, cvs-src@freebsd.org, cvs-all@freebsd.org, John Polstra <jdp@polstra.com>, Robert Watson <rwatson@freebsd.org>
Subject:   Re: cvs commit: src/sys/dev/bge if_bge.c
Message-ID:  <458E95A0.5040202@freebsd.org>
In-Reply-To: <20061224185620.M25303@delplex.bde.org>
References:  <XFMail.20061223102713.jdp@polstra.com> <20061223213014.U35809@fledge.watson.org> <20061224120307.P24444@delplex.bde.org> <458E10B0.5030506@samsco.org> <20061224185620.M25303@delplex.bde.org>

next in thread | previous in thread | raw e-mail | index | archive | help
Bruce Evans wrote:
> On Sun, 24 Dec 2006, Scott Long wrote:
> 
>> Bruce Evans wrote:
>>> On Sat, 23 Dec 2006, Robert Watson wrote:
>>>
>>>> On Sat, 23 Dec 2006, John Polstra wrote:
>>>>
>>>>>> That said, dropping and regrabbing the driver lock in the rxeof 
>>>>>> routine of any driver is bad.  It may be safe to do, but it incurs 
>>>>>> horrible performance penalties.  It essentially allows the 
>>>>>> time-critical, high priority RX path to be constantly preempted by 
>>>>>> the lower priority if_start or if_ioctl paths.  Even without this 
>>>>>> preemption and priority inversion, you're doing an excessive 
>>>>>> number of expensive lock ops in the fast path.
>>>
>>> It's not very time-critical or high priority for bge or any other device
>>> that has a reasonably large rx ring.  With a ring size of 512 and an rx
>>> interrupt occuring not too near the end (say at half way), you have 256
>>> packet times to finish processing the interrupt.  For normal 1518 byte
>>> packets at 1Gbps, 256 packet times is about 3 mS.  bge's rx ring size
>>> is actually larger than 512 for most hardware.
>>
>> Speed testing full size packets doesn't tax the hardware or the OS, nor
>> does it represent real world scenarios.  Testing minimum sized packets
>> isn't terribly real-world either, but it represents the worst-case for
>> the hardware and the OS and thus is a good standard for extrapolating
>> performance potential.  And, 1Gb isn't terribly interesting these days
>> either, 10Gb is.
> 
> I left calculating the timing for minimum size packets as an exercise.
> It's also not very interesting, because there are no device/bus/CPU
> combinations that can keep up with it, even at 1Gbps, due to overhead
> issues that push the latency issues further off.  The maximum packet
> rate in theory for 1Gbps is 2700 kpps (46 byte packets).  I believe
> the maximum packet rate in practice for anyGbps with a single 3-4GHz
> CPU is about 1000 kpps.  With an rx ring size of 1024 and an interrupt
> when the ring has 24 descriptors in it (the default is 10, which is
> far too small, but that is another bug), this leaves 1 mS of latency
> to space.  3 mS was eternity, and 1 mS is still eternity.

You're not entirely correct here.  Ethernet frames are always 64 bytes
even if IP packets can be less than that.  And there is the interframe
gap as well.  All in all the maximum frame/packet rate on 1G ethernet
is about 1440kpps.  A year a ago I was able to receive two such streams
w/o packet loss through em(4) and bge(4) PCI-X devices on a dual 852
Opteron system.  However I had to m_free() the mbufs before ether_input()
tried to look into them causing a full cache miss.  CPU load with a UP kernel
was between 40-60%.  This proves that receiving at those rates is indeed
possible and that the ethernet chips, bus system, OS scheduling, drivers
and mbuf allocation system are capable of handling very high loads if used
properly.

-- 
Andre




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?458E95A0.5040202>