From owner-freebsd-current@FreeBSD.ORG  Fri Nov 28 04:07:18 2003
Return-Path: <owner-freebsd-current@FreeBSD.ORG>
Delivered-To: freebsd-current@freebsd.org
Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125])
	by hub.freebsd.org (Postfix) with ESMTP
	id 3448D16A4CE; Fri, 28 Nov 2003 04:07:18 -0800 (PST)
Received: from stork.mail.pas.earthlink.net (stork.mail.pas.earthlink.net
	[207.217.120.188])	by mx1.FreeBSD.org (Postfix) with ESMTP
	id D112F43FDF; Fri, 28 Nov 2003 04:07:16 -0800 (PST)
	(envelope-from tlambert2@mindspring.com)
Received: from user-2ivfjun.dialup.mindspring.com ([165.247.207.215]
	helo=mindspring.com)
	by stork.mail.pas.earthlink.net with asmtp (SSLv3:RC4-MD5:128)
	(Exim 3.33 #1)	id 1APhNq-0002nN-00; Fri, 28 Nov 2003 04:06:03 -0800
Message-ID: <3FC739E7.317E34A0@mindspring.com>
Date: Fri, 28 Nov 2003 04:04:55 -0800
From: Terry Lambert <tlambert2@mindspring.com>
X-Mailer: Mozilla 4.79 [en] (Win98; U)
X-Accept-Language: en
MIME-Version: 1.0
To: Robert Watson <rwatson@freebsd.org>
References: <Pine.NEB.3.96L.1031127101515.30532A-100000@fledge.watson.org>
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
X-ELNK-Trace: b1a02af9316fbb217a47c185c03b154d40683398e744b8a4d61e2828877e5f32d10b2acb45641acd93caf27dac41a8fd350badd9bab72f9c350badd9bab72f9c
cc: freebsd-current@freebsd.org
cc: Poul-Henning Kamp <phk@phk.freebsd.dk>
Subject: Re: Apples linking
X-BeenThere: freebsd-current@freebsd.org
X-Mailman-Version: 2.1.1
Precedence: list
List-Id: Discussions about the use of FreeBSD-current
	<freebsd-current.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-current>,
	<mailto:freebsd-current-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-current>
List-Post: <mailto:freebsd-current@freebsd.org>
List-Help: <mailto:freebsd-current-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-current>,
	<mailto:freebsd-current-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Fri, 28 Nov 2003 12:07:18 -0000

Robert Watson wrote:
> On Thu, 27 Nov 2003, David Rhodus wrote:
> > what are some of the changes that Apple made to have everything
> > dynamically linked in darwin ? Has anyone done timed runs lately on
> > dynamically vers. static linking on darwin ? Or did they find just
> > cleaning up the dlopen code path seems to be enough to pull dynamically
> > linking everything ?
> 
> In Darwin, all binaries but /sbin/init are dynamically linked.

There are actually a couple more than that.  I specifically did an
audit of which programs on a fully installed binary Panther system
were statically vs. dynamically linked.  8-).

> There are
> a number of interesting optimizations, including prebinding.  The most
> interesting variation on the them is a series of system calls that manage
> a special "shared region" for prebound libraries.  These calls are:
> 
> int     load_shared_file();
> int     reset_shared_file();
> int     new_system_shared_region();

Correct.  You should also read this as "never use these calls in
your own code".

> My
> understanding is that they are always mapped into processes at the same
> address, so a prebound version of the library can be used across many
> applications.

Yes.  Their location is arrived at dynamically, and fixed statically;
this has the same advantage as System V shared libraries, but doesn't
burn chunks of address space for different versions of the same thing,
as time goes forward.  In particular, the libraries can be moved around,
but when they are, the code that depends on them is re-prebound to know
about the new location.

> If a library isn't found in the prebinding cache,
> the application does the work itself, but probably sends a message off.

It does.  This message triggers a prebinding.  Normally, this is
done automatically at the time the applicaiton is installed, but
there are circumstances where it needs to be triggered lazily, and
this handles that case.


> During system/application install, I believe OS X kicks off a tool to see
> if its current prebinding cache/layout/etc is optimal for the set of
> applications, and adjusts the cache as needed.

If you use the standard installation tools ("ViSE"), yes.


> I don't have access to any of the performance measurements -- perhaps
> Terry does -- but this approach has a number of important benefits.

I do, but the information is proprietary, for obvious reasons.

> In
> particular, it addresses the following issues:
[ ... ]

(4) It deals with the issue of needing fixup tables for shared
    libraries to get the right address, so it saves the fixup
    and the indirect that would be otherwise needed, as well.

> The one thing that
> turns me off to this scheme is that I'd like it if we could find a way to
> represent this using solely existing BSD/UNIX kernel primitives (mmap, et
> al) and userspace, rather than adding special-purpose system calls that
> complicated various code paths, and that aren't portable.

It would b relatively trivial to add two flags, on to mmap(), and
one to madvise().  The mmap() flag would indicate a global mapping
into all process address spaces (and select a mapping in a reserved
memory range for that purpose).  The madvise() flag would indicate
that the prebind region was being recreated for use by subsequently
started processes.


> As I mentioned previously in this thread, it could be we could witness a
> lot of the benefits of this approach by simply using heuristics to
> increase the likelihood of libraries getting mapped to the same address in
> different processes, increasing the effectiveness (and reducing the size)
> of the prebinding cache.

This is actually unlikely.  Minimally, you would need to ensure
against collisions, which is hard nough, but then you must also
avoid collisions with shared memory/mmap/dlopen objects, as well.
For this to work, a reserved address region is required.  And for
that, you would ned to take it from the KVA by splitting out the
address spaces so that UVA = 4G, KVA = 4G instead of UVA + KVA = 4G,
as it is today.

This is something FreeBSD should consider anyway: a reserved space
could be very useful, if only to represent the proc struct as R/O
in user space to avoid all system call overhead for getpid/getppid/etc.
system calls.  It would also be useful to avoid reducing UVA to 2G or
1G (depending on intended memory usage patterns) when you have 4G of
physical RAM installed in a machine.  It would also give enough room
so that the kmem_map would not need careful tuning to avoid mmory
exhaustion panics.

-- Terry