Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 29 Oct 2015 14:33:38 +0200
From:      Konstantin Belousov <kostikbel@gmail.com>
To:        Andriy Gapon <avg@FreeBSD.org>
Cc:        Alexander Motin <mav@FreeBSD.org>, freebsd-hackers <freebsd-hackers@FreeBSD.org>, Poul-Henning Kamp <phk@phk.freebsd.dk>, Jung-uk Kim <jkim@FreeBSD.org>
Subject:   Re: instability of timekeeping
Message-ID:  <20151029123338.GU2257@kib.kiev.ua>
In-Reply-To: <5631FB66.4000007@FreeBSD.org>
References:  <56261398.60102@FreeBSD.org> <56261FE6.90302@FreeBSD.org> <56274FFC.2000608@FreeBSD.org> <20151021184850.GX2257@kib.kiev.ua> <562F3E2F.2010100@FreeBSD.org> <20151027115810.GU2257@kib.kiev.ua> <562F8109.4050203@FreeBSD.org> <20151027140403.GB2257@kib.kiev.ua> <5630FC3B.2070908@FreeBSD.org> <5631FB66.4000007@FreeBSD.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On Thu, Oct 29, 2015 at 12:56:38PM +0200, Andriy Gapon wrote:
> On 28/10/2015 18:47, Andriy Gapon wrote:
> > In either case I am going to add a few more trace points in et_start and the
> > HPET timer code and see if I can catch anything interesting there.
> 
> Okay, more data:
> https://people.freebsd.org/~avg/timekeeping-ktr.2.patch
> https://people.freebsd.org/~avg/timekeeping.2.ktrdump.txt
> 
> I think that the snippet (amended with some notes of mine) makes it painfully
> obvious that the timer interrupt got very delayed when all CPUs entered the idle
> state.
> I do not see anything that could suggest a FreeBSD bug.
> 
> There is another sad discovery.  Turns out that my CPU model provides two ways
> of doing C1E magic.  The sane one: the north bridge logic in the CPU performs a
> read of a configured LVL3 register so that C3 is entered.  The insane one: the
> CPU NB performs a write of a configured value to a configured SMI register, so
> that the SMI is generated and an SMM handler does the job (probably reading from
> LVL2 or LVL3).  Looking at MSR C001_0055 I see that my BIOS has chosen the
> insane approach[*], quite unfortunately.  Bugs in the SMM code are not unheard
> of, to put it mildly, so that could be an explanation for my problem.
> 
> So, I guess I'll just disable C1E and end this investigation.
> 
> [*]
> $ cpucontrol -m 0xc0010055 /dev/cpuctl0
> 
> 
> MSR 0xc0010055: 0x00000000 0x083400b0
> 
> SmiOnCmpHalt: SMI on chip multi-processing halt.
>  - write 0x34 to port 0xb0

What if you enable C3 (in OS, and possibly BIOS), but disable C1E ?
Does the issue with jitter persist ?



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20151029123338.GU2257>