Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 18 Oct 2020 18:23:09 +0100
From:      Steve O'Hara-Smith <steve@sohara.org>
To:        "John Levine" <johnl@iecc.com>
Cc:        freebsd-questions@freebsd.org, naddy@mips.inka.de
Subject:   Re: printf(1) and UTF-8 multi-byte chars
Message-ID:  <20201018182309.490ff752536eae2092533c5a@sohara.org>
In-Reply-To: <20201018154838.49CBC239CEDF@ary.qy>
References:  <slrnroo8n9.1iu4.naddy@lorvorc.mips.inka.de> <20201018154838.49CBC239CEDF@ary.qy>

Next in thread | Previous in thread | Raw E-Mail | Index | Archive | Help
On 18 Oct 2020 11:48:37 -0400
"John Levine" <johnl@iecc.com> wrote:

> I don't think there is any useful middle ground between counting bytes
> and full Unicode typesetting.

	There are good reasons for using all three levels, here are some:

Bytes: Content length headers, malloc calls - storage related
Glyphs: Truncation, apparent length, sorting - appearance related
Unicode Characters: UTF-8/16/32 conversions - encoding related

-- 
Steve O'Hara-Smith <steve@sohara.org>



Want to link to this message? Use this URL: <http://docs.FreeBSD.org/cgi/mid.cgi?20201018182309.490ff752536eae2092533c5a>