From owner-freebsd-pf@FreeBSD.ORG  Tue May  3 10:18:02 2011
Return-Path: <owner-freebsd-pf@FreeBSD.ORG>
Delivered-To: freebsd-pf@freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34])
	by hub.freebsd.org (Postfix) with ESMTP id B172B1065677;
	Tue,  3 May 2011 10:18:02 +0000 (UTC) (envelope-from dudu@dudu.ro)
Received: from mail-qy0-f182.google.com (mail-qy0-f182.google.com
	[209.85.216.182])
	by mx1.freebsd.org (Postfix) with ESMTP id 5322C8FC0A;
	Tue,  3 May 2011 10:18:01 +0000 (UTC)
Received: by qyk27 with SMTP id 27so3982165qyk.13
	for <multiple recipients>; Tue, 03 May 2011 03:18:01 -0700 (PDT)
Received: by 10.229.29.20 with SMTP id o20mr7031652qcc.181.1304417881210; Tue,
	03 May 2011 03:18:01 -0700 (PDT)
MIME-Version: 1.0
Received: by 10.229.105.18 with HTTP; Tue, 3 May 2011 03:17:21 -0700 (PDT)
In-Reply-To: <BANLkTi=DB7xw57LPCyUDzzeGYqX=j6Ju4w@mail.gmail.com>
References: <20110503015854.GA31444@icarus.home.lan>
	<20110503084800.GB9657@insomnia.benzedrine.cx>
	<20110503091619.GA39329@icarus.home.lan> <4DBFCB8D.10105@unsane.co.uk>
	<BANLkTi=DB7xw57LPCyUDzzeGYqX=j6Ju4w@mail.gmail.com>
From: Vlad Galu <dudu@dudu.ro>
Date: Tue, 3 May 2011 12:17:21 +0200
Message-ID: <BANLkTim9o0dCef_BVT29YviSiR0oXyk6VQ@mail.gmail.com>
To: Vincent Hoffman <vince@unsane.co.uk>
Content-Type: text/plain; charset=ISO-8859-1
X-Content-Filtered-By: Mailman/MimeDel 2.1.5
Cc: freebsd-stable@freebsd.org, Jeremy Chadwick <freebsd@jdc.parodius.com>,
	freebsd-pf@freebsd.org
Subject: Re: RELENG_8 pf stack issue (state count spiraling out of control)
X-BeenThere: freebsd-pf@freebsd.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: "Technical discussion and general questions about packet filter
	\(pf\)" <freebsd-pf.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-pf>,
	<mailto:freebsd-pf-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-pf>
List-Post: <mailto:freebsd-pf@freebsd.org>
List-Help: <mailto:freebsd-pf-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-pf>,
	<mailto:freebsd-pf-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Tue, 03 May 2011 10:18:02 -0000

On Tue, May 3, 2011 at 12:12 PM, Vlad Galu <dudu@dudu.ro> wrote:

>
>
> On Tue, May 3, 2011 at 11:31 AM, Vincent Hoffman <vince@unsane.co.uk>wrote:
>
>> On 03/05/2011 10:16, Jeremy Chadwick wrote:
>>
>> <snip lots of data relevant to the discussion but not my answer>
>> > Sadly I don't see a way with bsnmpd(8) to monitor things like interrupt
>> > usage, etc. otherwise I'd be graphing that.  The more monitoring the
>> > better; at least then I could say "wow, interrupts really did shoot
>> > through the roof -- the box went crazy!" and RMA the thing.  :-)
>> >
>> you could use net-mgmt/bsnmp-regex although I dont know what the
>> overhead for that is like.
>>
>
> I use munin for graphing, as it allows easy scripting without using SNMP.
>
> My case is a bit different from Jeremy's. Every once in a while there is a
> sudden traffic spike which impacts pf performance as well. However, the
> graphed figures are nowhere near what I'd consider alarming levels (this box
> has withstood more in the past). I was able to coincidentally log in after
> such a spike and noticed the pfpurge thread eating up about 30% of the CPU
> while using the normal optimization policy. In my case, it could be related
> to another issue I'm seeing on this box - mbuma allocation failures. Here
> are my graphs:
>
> http://dl.dropbox.com/u/14650083/PF/bge_bits_1-week.png
> http://dl.dropbox.com/u/14650083/PF/bge_packets_1-week.png
> http://dl.dropbox.com/u/14650083/PF/bge_stats_1-week.png
> http://dl.dropbox.com/u/14650083/PF/load-week.png
> http://dl.dropbox.com/u/14650083/PF/mbuf_errors-week.png
> http://dl.dropbox.com/u/14650083/PF/mbuf_usage-week.png
> http://dl.dropbox.com/u/14650083/PF/pf_inserts-week.png
> http://dl.dropbox.com/u/14650083/PF/pf_matches-week.png
> http://dl.dropbox.com/u/14650083/PF/pf_removals-week.png
> http://dl.dropbox.com/u/14650083/PF/pf_searches-week.png
> http://dl.dropbox.com/u/14650083/PF/pf_src_limit-week.png
> http://dl.dropbox.com/u/14650083/PF/pf_states-week.png
> http://dl.dropbox.com/u/14650083/PF/pf_synproxy-week.png
>
> I'll wait for the next time the symptom occurs to switch to a stateless
> configuration.
>
>
I forgot to mention this is a UP box using TSC for timekeeping and running
ntpd.

-- /boot/loader.conf --
hint.p4tcc.0.disabled="1"
hint.acpi_throttle.0.disabled="1"
debug.acpi.disabled="timer"
-- /boot/loader.conf --

-- sysctl output --
kern.timecounter.choice: TSC(800) i8254(0) dummy(-1000000)
kern.timecounter.hardware: TSC
-- sysctl output --


-- 
Good, fast & cheap. Pick any two.