From owner-freebsd-hackers@FreeBSD.ORG Tue Apr 28 12:22:24 2009 Return-Path: Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id BA9F7106566B for ; Tue, 28 Apr 2009 12:22:24 +0000 (UTC) (envelope-from joerg@britannica.bec.de) Received: from www.sonnenberger.org (www.ostsee-abc.de [62.206.222.50]) by mx1.freebsd.org (Postfix) with ESMTP id 2AF9B8FC1F for ; Tue, 28 Apr 2009 12:22:24 +0000 (UTC) (envelope-from joerg@britannica.bec.de) Received: from britannica.bec.de (www.sonnenberger.org [192.168.1.10]) by www.sonnenberger.org (Postfix) with ESMTP id 553AC6674D for ; Tue, 28 Apr 2009 14:22:19 +0200 (CEST) Received: by britannica.bec.de (Postfix, from userid 1000) id C543F1BDD7B; Tue, 28 Apr 2009 14:22:25 +0200 (CEST) Date: Tue, 28 Apr 2009 14:22:25 +0200 From: Joerg Sonnenberger To: freebsd-hackers@freebsd.org Message-ID: <20090428122225.GA2862@britannica.bec.de> Mail-Followup-To: freebsd-hackers@freebsd.org References: <20090427183836.GA10793@zim.MIT.EDU> <49F5FE45.2090101@freebsd.org> <20090427193326.GA7654@britannica.bec.de> <20090427194904.GA11137@zim.MIT.EDU> <49F6C7A1.6070708@FreeBSD.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <49F6C7A1.6070708@FreeBSD.org> User-Agent: Mutt/1.5.19 (2009-01-05) Subject: Re: SoC 2009: BSD-licensed libiconv in base system X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 28 Apr 2009 12:22:25 -0000 On Tue, Apr 28, 2009 at 11:08:49AM +0200, Gabor Kovesdan wrote: > Citrus is based on UCS-4 as an internal encoding, just like the another > BSD-licensed iconv library. This is a barrier to support encodings that > aren't supported by UCS-4. More precisely is that Citrus can and will use UCS-4 as appropiate. It does not enforce it. Which is an important difference. One reason that it is often used is that it helps to avoid exponential growth of the translation tables. It just isn't worth the time to write ISO-8859-1 to ISO-8859-15 (trivial), if the translation to and from UCS4 gives the same result with marginally more work. > It's possible that there are little poor countries with an own writing > system but probably their writing system is unsupported because the > starvation, poorness and lack of water and electricity are more serious > problems there. I wouldn't call all both parts of Korea little poor countries and it is a wonderful example for why UCS 4 Level 1 can be problematic. Joerg