Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 4 Aug 2017 19:02:38 +0200
From:      Hans Petter Selasky <hps@selasky.org>
To:        Ben RUBSON <ben.rubson@gmail.com>, FreeBSD Net <freebsd-net@freebsd.org>
Subject:   Re: mlx4en, timer irq @100%...
Message-ID:  <72b9de84-5572-3737-b274-34d2c5cdf634@selasky.org>
In-Reply-To: <BF3A3E47-A726-49FB-B83F-666EFC5C0FF1@gmail.com>
References:  <BF3A3E47-A726-49FB-B83F-666EFC5C0FF1@gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
On 08/04/17 18:59, Ben RUBSON wrote:
> Hello,
> 
> Not sure this is the right list, but as it seems related to a mlx4en device...
> 
> # vmstat -i 1
> (...)
> interrupt                          total       rate
> cpu23:timer                         1198       1127
> 
> # top -P ALL
> (...)
> CPU 23:  0.0% user,  0.0% nice,  0.0% system,  100% interrupt,  0.0% idle
> 
> # netstat -I mlxen0 -d -w 1
>              input         mlxen0           output
>     packets  errs idrops      bytes    packets  errs      bytes colls drops
> (and not output at all, same for mlxen1 !)
> 
> # uname -sr
> FreeBSD 11.0-RELEASE-p9
> 
> So, as you can see, one of my CPUs is used at 100% by timer interrupts,
> since about 2 hours, and suddenly.
> Initiating network connections to this server is now slow.
> And what I found is that I can't use netstat on my 2 mlx4en devices anymore
> (my monitoring tool is then no more fed).
> 
> sysctl hw.mlxen0 is OK, no errors, and trafic counters grow slowly.
> 
> What should I do ?
> How to investigate on this ?
> 
> Thank you very much for your help & support,
> 

Hi,

Try "procstat -ak". It should give an idea what is going on.

What version of FreeBSD is this?

Is this a regression issue?

--HPS



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?72b9de84-5572-3737-b274-34d2c5cdf634>