From owner-freebsd-current@FreeBSD.ORG Sat Aug 23 11:56:28 2008 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 456D31065678 for ; Sat, 23 Aug 2008 11:56:28 +0000 (UTC) (envelope-from erikt@midgard.homeip.net) Received: from ch-smtp02.sth.basefarm.net (ch-smtp02.sth.basefarm.net [80.76.149.213]) by mx1.freebsd.org (Postfix) with ESMTP id C610B8FC13 for ; Sat, 23 Aug 2008 11:56:27 +0000 (UTC) (envelope-from erikt@midgard.homeip.net) Received: from c83-253-7-225.bredband.comhem.se ([83.253.7.225]:64438 helo=falcon.midgard.homeip.net) by ch-smtp02.sth.basefarm.net with esmtp (Exim 4.68) (envelope-from ) id 1KWrUd-00028D-7R for freebsd-current@freebsd.org; Sat, 23 Aug 2008 13:41:19 +0200 Received: (qmail 2531 invoked from network); 23 Aug 2008 13:41:16 +0200 Received: from owl.midgard.homeip.net (10.1.5.7) by falcon.midgard.homeip.net with ESMTP; 23 Aug 2008 13:41:16 +0200 Received: (qmail 40167 invoked by uid 1001); 23 Aug 2008 13:41:16 +0200 Date: Sat, 23 Aug 2008 13:41:16 +0200 From: Erik Trulsson To: Alexander Churanov Message-ID: <20080823114116.GA40125@owl.midgard.homeip.net> References: <3cb459ed0808230256g3f0f51epd9ab54047d3bd681@mail.gmail.com> <20080823102656.GE99951@hoeg.nl> <20080823110406.GB10445@saltmine.radix.net> <3cb459ed0808230416w701714e8p7be03e544a964e7b@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <3cb459ed0808230416w701714e8p7be03e544a964e7b@mail.gmail.com> User-Agent: Mutt/1.5.18 (2008-05-17) X-Originating-IP: 83.253.7.225 X-Scan-Result: No virus found in message 1KWrUd-00028D-7R. X-Scan-Signature: ch-smtp02.sth.basefarm.net 1KWrUd-00028D-7R adecf1ee8660f56e41a09238ce14047c Cc: Ed Schouten , FreeBSD Current , Thomas Dickey Subject: Re: Unicode-based FreeBSD X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 23 Aug 2008 11:56:28 -0000 On Sat, Aug 23, 2008 at 03:16:43PM +0400, Alexander Churanov wrote: > Thomas, > > 2008/8/23, Thomas Dickey : > > > > ...but it does help if the terminal can display the result. > > > > ...before IUTF8, there was some consensus for a few years that it was > > up to the application to do proper backspacing. (ncurses does this anyway, > > but apparently shell interpreters such as bash need extra assistance). > > > In brief my idea is that system and applications use UTF-8 and syscond just > maps UTF-8 to 256 (or whatever) characters it is actually able to display. > This is very similar to current screenmaps, but new maps will just describe > what 256 characters of the whole unicode range can be actually displayed. > Or, in other words, always map from UTF-8. There are many applications that do not yet support UTF-8. It would be bad if applications that just output 8-bit characters "as-is" were broken. If an application were to output characters from (e.g.) ISO-8859-1 and syscons were to interpret them as UTF-8 it would not be pretty. > > Then, since syscons is going to be unicode-aware, it can do proper > backspacing if it is given a sequence of 4 bytes where first 3 describe a > single code point and fourth is a backslash. In my opinion, this solution > would mostly keep current applications running correctly and introduce the > ability to use UTF-8 IO. I suspect it would actually break many current applications. -- Erik Trulsson ertr1013@student.uu.se