From owner-freebsd-current@FreeBSD.ORG  Thu May  6 08:32:22 2004
Return-Path: <owner-freebsd-current@FreeBSD.ORG>
Delivered-To: freebsd-current@freebsd.org
Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125])
	by hub.freebsd.org (Postfix) with ESMTP id EAB1516A4CE
	for <freebsd-current@FreeBSD.org>;
	Thu,  6 May 2004 08:32:21 -0700 (PDT)
Received: from fledge.watson.org (fledge.watson.org [204.156.12.50])
	by mx1.FreeBSD.org (Postfix) with ESMTP id 3A3FC43D46
	for <freebsd-current@FreeBSD.org>;
	Thu,  6 May 2004 08:32:21 -0700 (PDT)
	(envelope-from robert@fledge.watson.org)
Received: from fledge.watson.org (localhost [127.0.0.1])
	by fledge.watson.org (8.12.11/8.12.11) with ESMTP id i46FWALo078735;
	Thu, 6 May 2004 11:32:10 -0400 (EDT)
	(envelope-from robert@fledge.watson.org)
Received: from localhost (robert@localhost)i46FW9HR078732;
	Thu, 6 May 2004 11:32:10 -0400 (EDT)
	(envelope-from robert@fledge.watson.org)
Date: Thu, 6 May 2004 11:32:09 -0400 (EDT)
From: Robert Watson <rwatson@FreeBSD.org>
X-Sender: robert@fledge.watson.org
To: Bruce Evans <bde@zeta.org.au>
In-Reply-To: <20040506180456.J19447@gamplex.bde.org>
Message-ID: <Pine.NEB.3.96L.1040506112448.64280E-100000@fledge.watson.org>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
cc: freebsd-current@FreeBSD.org
cc: Gerrit Nagelhout <gnagelhout@sandvine.com>
Subject: RE: 4.7 vs 5.2.1 SMP/UP bridging performance
X-BeenThere: freebsd-current@freebsd.org
X-Mailman-Version: 2.1.1
Precedence: list
List-Id: Discussions about the use of FreeBSD-current
	<freebsd-current.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-current>,
	<mailto:freebsd-current-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-current>
List-Post: <mailto:freebsd-current@freebsd.org>
List-Help: <mailto:freebsd-current-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-current>,
	<mailto:freebsd-current-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Thu, 06 May 2004 15:32:22 -0000


On Thu, 6 May 2004, Bruce Evans wrote:

> On Wed, 5 May 2004, Robert Watson wrote:
> 
> > The reason to run with these patches is that, without them, it's not safe
> > to run without Giant over the lower half of the network stack, so all the
> > network interrupt threads are running with Giant otherwise.
> 
> What does this particular finer grained locking do for the number of
> locks.  Nothing good I fear. 

Well, it introduces more, certainly, and this is definitely a first cut
that we will want to refine substantially.  However, it offers advantages
on both UP and SMP: on UP by offering lower latency for network processing
through the opportunity to preempt earlier and not contend with Giant on
the other half of the kernel, and SMP by offering lower latency, improving
parallelism, and reducing contention.  At least, that's what it offers in
the presence of a variety of other work to improve interrupt handling,
scheduling, etc.  That said, on SMP I see a several times speedup in
network performance in network intensive activities.

> > Also, in that
> > patchset, network processing to completion runs in the ithread rather than
> > the netisr, so you get lower latency and more parallelism even for
> > bridging.
> 
> Doesn't it give less parallelism, and higher latency for other
> interrupts, but lower latency for this interrupt of course? 

Well, it depends a fair amount on your workload and configuration.  On SMP
with bridging, it should allow the interrupt threads for the two
interfaces to run on different CPUs simultaneously acting as pipelines,
subject to lock contention and other load.  In SMP IP packet forwarding
tests I ran, this actually seemed to happen fairly reasonably and offered
substantial performance benefits.

> > Another known issue is the latency from interrupt generation to interrupt
> > task execution in the interrupt thread.
> 
> Especially with the current bugs:
> - interrupt thread execution is often or always deferred until the running
>   thread gives up control or returns to userland, and all netisr and other.
>   This bug has been active since last November.  Before then the indefinite
>   deferral only happened occasionally.
> - netisr and most other SWI priorities are backwards relative to softclock's
>   priority.  This bug seems to have been active since the first version of
>   SMPng.

Well, both of these bugs would explain regressions I've seen in latency
for network I/O handling, but I hadn't yet been able to track them down to
a particular source.  One of the things I'd like to do when the first pass
at top-to-bottom Giant free operation is complete is sit down with KTR and
look at a complete trace of context switches during various types of
operations.  I suspect we'll see a lot of nits we'll need to track down
which will have a substantial performance benefit when corrected.  For
example, it looks like our CV implementation generates an extra context
switch on wakeup that we can probably eliminate.  From chatting with John,
it sounds like the first one is resolved in his work branch, but
under-tested for a merge yet. 

Robert N M Watson             FreeBSD Core Team, TrustedBSD Projects
robert@fledge.watson.org      Senior Research Scientist, McAfee Research