Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 27 Mar 2009 17:36:10 -0500
From:      Brooks Davis <brooks@freebsd.org>
To:        Robert Watson <rwatson@freebsd.org>
Cc:        freebsd-current@freebsd.org, Prashant Vaibhav <prashant.vaibhav@gmail.com>
Subject:   Re: Improving the kernel/i386 timecounter performance (GSoC proposal)
Message-ID:  <20090327223610.GA58090@lor.one-eyed-alien.net>
In-Reply-To: <alpine.BSF.2.00.0903272217340.60642@fledge.watson.org>
References:  <11609492.9579.1238167614335.JavaMail.root@vms070.mailsrvcs.net> <49CD0405.1060704@samsco.org> <alpine.BSF.2.00.0903271821060.60642@fledge.watson.org> <17560ccf0903271348p52351481v4cc83c14037e8836@mail.gmail.com> <alpine.BSF.2.00.0903272217340.60642@fledge.watson.org>

next in thread | previous in thread | raw e-mail | index | archive | help

--oyUTqETQ0mS9luUI
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Fri, Mar 27, 2009 at 10:19:35PM +0000, Robert Watson wrote:
>=20
> On Sat, 28 Mar 2009, Prashant Vaibhav wrote:
>=20
>> Actually OS X is more similar than that: the shared page also contains=
=20
>> functions that can be called by user applications, though their entry=20
>> points are fixed and they're not in any particular format like elf/mach-=
o.=20
>> Userspace implementations of gettimeofday, bcopy etc. are provided in th=
e=20
>> kernel itself, which is a nice design imo as the specific version to loa=
d=20
>> is chosen by the kernel at boot time depending on processor capabilities.
>=20
> One cute thing about Linux exporting the page as ELF is that the dynamic=
=20
> linker just finds and links libc against it for the system call path.  EL=
F=20
> is a fairly straight-forward format, so it's not a bad approach, although=
=20
> it does make the kernel side more complex.  One downside, of course, is=
=20
> that it means the kernel has to export 32-bit code to 32-bit processes,=
=20
> 64-bit code to 64-bit processes, etc, if you want the higher performance=
=20
> stuff for 32-bit processes on 64-bit kernels, you have to build the expos=
ed=20
> code as non-native code.

Either way, I suspect we really want a function based interface because the=
n we
have a layer of insulation between the kernel and userspace.  Without this,
we're stuck providing any bits in the shared page forever to support old
binaries.

-- Brooks

> Robert N M Watson
> Computer Laboratory
> University of Cambridge
>=20
>=20
>>=20
>>=20
>>=20
>> On Fri, Mar 27, 2009 at 11:53 PM, Robert Watson <rwatson@freebsd.org> wr=
ote:
>>=20
>>       On Fri, 27 Mar 2009, Scott Long wrote:
>>=20
>>             I've been talking about this for years. ??All I need
>>             is help with the VM magic to create the page on
>>             fork. ??I also want two pages, one global for
>>             gettimeofday (and any other global data we can think
>>             of) and one per-process for static data like
>>             getpid/getgid.
>>=20
>>=20
>> FWIW, there are some variations in schemes across OS's -- one extreme
>> is the Linux approach, which actually exports a mini shared library in
>> ELF format on the shared page, providing implementations of various
>> services (such as entering system calls), time stuff, etc. ??Less
>> extreme are the shared pages offered on Mac OS X, etc.
>>=20
>> Robert N M Watson
>> Computer Laboratory
>> University of Cambridge
>>=20
>>=20
>>=20
>>       Scott
>>=20
>>=20
>>       Sergey Babkin wrote:
>>             ?? (Sorry for the top quoting). Probably the
>>             best implementation of
>>             ?? gettimeofd=3Dy() is to have
>>             ?? a page in the kernel mapped read-only to all
>>             the user pr=3Dcesses. Put
>>             ?? the kernel's idea of time
>>             ?? into this page. Then getting the =3Dime
>>             becomes a simple read (OK, two
>>             ?? reads, to make sure that
>>             ?? no update =3Das happened in between).
>>             ?? The TSC can then be used to add the
>>             precis=3Don between the ticks of
>>             ?? the kernel timer:
>>             ?? i.e. remember the value of TS=3D when the last
>>             tick happen, and the
>>             ?? highest rate at which
>>             ?? TSC may be ti=3Dking at this CPU, and export
>>             in the same page. This
>>             ?? would guarantee thatthe time is not moving
>>             back.
>>             ?? However there are more issues with TS=3D. TSC
>>             is guaranteed to have
>>             ?? the same value
>>             ?? on all the processors that s=3Dare the same
>>             system bus. But if the
>>             ?? machine is built of multiple
>>             ?? buses =3Dith bridges between them, all bets
>>             are off. Each bus may be
>>             ?? stopped, resta=3Dted
>>             ?? and clocked separately. There is no way to
>>             tell, on which CPU is th=3D
>>             ?? process currently
>>             ?? runnning, and it may be rescheduled do a
>>             different C=3DU right before
>>             ?? or after the RDTSC
>>             ?? instruction.
>>             ?? -SB
>>             ?? Ma=3D 26, 2009 06:55:04 PM,
>>             [1]phk@phk.freebsd.dk wrote:
>>             ?? ?? ?? ??In message
>>             <[2]17560ccf0903260551v1f5cba9eu8
>>             7727c0bae7baa3@mail.gmail.com>, Prasha
>>             ?? ?? nt Vaibhav writes:
>>             ?? ?? =3DThe gettimeofday() function's
>>             implementation will then be
>>             ?? ?? >change=3D to read the timestamp counter
>>             (TSC) from the processor,
>>             ?? ?? and use the
>>             ?? ?? &g=3D;reading in conjunction with the timing
>>             info exported by the
>>             ?? ?? kernel to
>>             ?? ?? =3Dcalculate and return the time info in
>>             proper format.
>>             ?? ?? I take it a=3D read, that you know that
>>             there are other relvant
>>             ?? ?? functions than gettim=3Dofday() and that
>>             these must provide a
>>             ?? ?? monotonic timescale when queried
>>             =3Dnterleaved ?
>>             ?? ?? Be aware that the TSC may not be, and may
>>             not stay syn=3Dhronized
>>             ?? ?? across multiple cores.
>>             ?? ?? Further more, the TSC is not con=3Dtant
>>             frequency and in particular
>>             ?? ?? not "known frequency" at all times.
>>             ?? ?? There are a lot of nasty cases to check,
>>             and a nasty interpolation
>>             ?? ?? =3Dequired, which, in my tests some years
>>             back, totally negated any
>>             ?? ?? speedu=3D from using the TSC in the first
>>             place.
>>             ?? ?? At the very minimum, you wi=3Dl have to add
>>             a quirk table where
>>             ?? ?? known good {CPU+MOBO+BIOS} combinatio=3Ds
>>             can be entered, as we
>>             ?? ?? find them.
>>             ?? ?? >This will also pave way f=3Dr optionally
>>             making the
>>             ?? ?? >FreeBSD kernel tickless,
>>             ?? ?? Rubbish. T=3Dmecounters are not even closely
>>             associated with the
>>             ?? ?? tick or ticklessnes=3D of the kernel. [1]
>>             ?? ?? > - The TSC frequency might change on
>>             cert=3Din processors with
>>             ?? ?? non-constant
>>             ?? ?? > TSC rate (because of SpeedStep, =3Dynamic
>>             freq scaling etc.). The
>>             ?? ?? only way to
>>             ?? ?? > combat this is that t=3De kernel be
>>             notified every time the
>>             ?? ?? processor
>>             ?? ?? > frequency changes.=3Dvery cpu frequency
>>             driver will need to be
>>             ?? ?? updated to
>>             ?? ?? > notify the=3Dernel before and after a cpu
>>             freq change.
>>             ?? ?? That is not good enough=3D the bios may
>>             autonomously change the cpu
>>             ?? ?? speed
>>             ?? ?? and the skew from not k=3Dowing exactly
>>             _when_ and _how_ the cpu
>>             ?? ?? clock
>>             ?? ?? changed, is a significant =3Dumber of
>>             microseconds, plenty of time
>>             ?? ?? to make strange things happen.
>>             ?? ?? You will want to study carefully Dave
>>             Mills work to tame the alpha
>>             ?? ?? =3Dhips wandering SAW clocks.
>>             ?? ?? Poul-Henning
>>             ?? ?? [1] In my mind, rewo=3Dking the callout
>>             system in the kernel would
>>             ?? ?? be a much better more neded=3Dnd much more
>>             worthwhile project.
>>             ?? ?? --
>>             ?? ?? Poul-Henning Kamp | =3DNIX since Zilog Zeus
>>             3.20
>>             ?? ?? [3]phk@FreeBSD.ORG | TCP=3DIP since RFC 956
>>             ?? ?? FreeBSD committer | BSD since 4.3-tahoe
>>             ?? ?? N=3Dver attribute to malice what can
>>             adequately be explained by
>>             ?? ??
>>             incompetence.<=3Dr>_________________________________________=
______
>>             ?? ?? [4]freebsd-hackers@freebsd.org mailing
>>             list
>>             ?? ??
>>             [5]http://lists.freebsd.org/mailman/listinfo/freebsd-hackers=
To
>>             ?? ?? unsubscribe, send any mail to "[6]fre
>>             ebsd-hackers-unsubscribe@freebsd.org"
>>=20
>>             References
>>=20
>>             ?? 1. 3D"mailto:phk@phk.freebsd.dk"
>>             ?? 2. file://localhost/tmp/3D ?? 3.
>>             3D"mailto:phk@FreeBSD.ORG"
>>             ?? 4. 3D"mailto:fre ?? 5. 3D"http://lists.=3D/
>>             ?? 6.3D"mailto:freebsd-hackers-unsub________________________=
____________________
>>             ___
>>             freebsd-current@freebsd.org mailing list
>>             http://lists.freebsd.org/mailman/listinfo/freebsd-current
>>             To unsubscribe, send any mail to
>>             "freebsd-current-unsubscribe@freebsd.org"
>>=20
>>=20
>>       _______________________________________________
>>       freebsd-current@freebsd.org mailing list
>>       http://lists.freebsd.org/mailman/listinfo/freebsd-current
>>       To unsubscribe, send any mail to
>>       "freebsd-current-unsubscribe@freebsd.org"
>>=20
>>=20
>>=20
>>=20

> _______________________________________________
> freebsd-current@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-current
> To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org"


--oyUTqETQ0mS9luUI
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (FreeBSD)

iD8DBQFJzVTZXY6L6fI4GtQRAoh1AJ9mQRsh0uYKBYCYjRq4z2qdAe+oZQCfZaTy
qqkeccekwxbFjWvg+wKGpqI=
=dQO7
-----END PGP SIGNATURE-----

--oyUTqETQ0mS9luUI--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20090327223610.GA58090>