Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 22 Apr 2012 14:56:14 +0200
From:      =?ISO-8859-1?Q?Erik_N=F8rgaard?= <norgaard@locolomo.org>
To:        freebsd-questions@freebsd.org
Subject:   Re: converting UTF-8 to HTML
Message-ID:  <4F93FFEE.4040905@locolomo.org>
In-Reply-To: <20120422130642.cb5b09c2.freebsd@edvax.de>
References:  <20120421055823.GA6788@tinyCurrent> <4F9253D7.7010609@locolomo.org> <4F9278A2.1020301@locolomo.org> <alpine.BSF.2.00.1204210909450.5338@abbf.6qbyyneqvnyhc.pbz> <4F93CC95.5050209@locolomo.org> <4F93E159.7020807@infracaninophile.co.uk> <20120422130642.cb5b09c2.freebsd@edvax.de>

next in thread | previous in thread | raw e-mail | index | archive | help
On 22/04/2012 13:06, Polytropon wrote:

> How about the "extended ASCII character set" that has a mixture
> of "non-US glyphs" and semi-graphic symbols?
>
> 	http://asciiset.com/extended.gif

I can't even write my name in that character set.

As long as there are multiple charactersets you will have the problem of 
some characters being shown wrong. This is nothing particular for UTF-8, 
you have the problem even when choosing between the 10+ different ISO-8859.

The only thing that UTF-8 introduce is the variable byte length 
characters so you can't equate no. bytes with no. characters.

Cheers, Erik

-- 
M: +34 666 334 818
T: +34 915 211 157



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?4F93FFEE.4040905>