From owner-freebsd-questions@FreeBSD.ORG Sat Apr 21 13:13:17 2012 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 869DA1065674 for ; Sat, 21 Apr 2012 13:13:17 +0000 (UTC) (envelope-from guru@unixarea.de) Received: from ms16-1.1blu.de (ms16-1.1blu.de [89.202.0.34]) by mx1.freebsd.org (Postfix) with ESMTP id 1779A8FC15 for ; Sat, 21 Apr 2012 13:13:17 +0000 (UTC) Received: from [188.174.57.140] (helo=localhost.my.domain) by ms16-1.1blu.de with esmtpsa (TLS-1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.69) (envelope-from ) id 1SLa7n-0003fB-Nu for freebsd-questions@freebsd.org; Sat, 21 Apr 2012 15:13:15 +0200 Received: from localhost.my.domain (localhost [127.0.0.1]) by localhost.my.domain (8.14.4/8.14.3) with ESMTP id q3LDDEKP009566 for ; Sat, 21 Apr 2012 15:13:14 +0200 (CEST) (envelope-from guru@unixarea.de) Received: (from guru@localhost) by localhost.my.domain (8.14.4/8.14.3/Submit) id q3LDDDI1009565 for freebsd-questions@freebsd.org; Sat, 21 Apr 2012 15:13:13 +0200 (CEST) (envelope-from guru@unixarea.de) X-Authentication-Warning: localhost.my.domain: guru set sender to guru@unixarea.de using -f Date: Sat, 21 Apr 2012 15:13:13 +0200 From: Matthias Apitz To: freebsd-questions@freebsd.org Message-ID: <20120421131313.GA9502@tinyCurrent> References: <20120421055823.GA6788@tinyCurrent> <4F9253D7.7010609@locolomo.org> <4F9278A2.1020301@locolomo.org> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <4F9278A2.1020301@locolomo.org> X-Operating-System: FreeBSD 9.0-CURRENT r214444 (i386) User-Agent: Mutt/1.5.21 (2010-09-15) X-Con-Id: 51246 X-Con-U: 0-guru X-Originating-IP: 188.174.57.140 Subject: Re: converting UTF-8 to HTML X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: Matthias Apitz List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 21 Apr 2012 13:13:17 -0000 El día Saturday, April 21, 2012 a las 11:06:42AM +0200, Erik Nørgaard escribió: > On 21/04/2012 08:29, Erik Nørgaard wrote: > > Browsers understand UTF-8 perfectly, simply add > > to the html header. > > Obviously I can't know what your project is, but you'll save yourself > heaps of problems sticking to UTF-8, in particular if you plan on > implementing any search functionality or have users submit content. > Enforce and stick to UTF-8. Well, it is no 'project'. I'm writing a diary of what's going on in my life. And still doing it in ISO 8859-1 environment, but in HTML to include pictures etc. ISO 8859-1 is still fine for it because I do it in Spanish for some reasons, and ISO 8859-1 have enough chars, even the tilded ones like áíóñ... but sometimes I need to include a phrase in another language, Russian or Greek, or whatever (see the other mail). And so it is nice to translate this to HTML encodings in ASCII. That's all. > When characters show up wrong in the users browser it's usually because > the browser is set to use a non-UTF-8 charset by default such as > windows-1252, the web server sends the charset=ascii in the http header > and there is no or incorrect meta tag to resolve the problem. Non UTF-8 > charsets are a leftover from last millenia that we sometimes still choke > on .. sorry the rant ;) We all here are leftover from last millenia. :-) matthias -- Matthias Apitz t +49-89-61308 351 - f +49-89-61308 399 - m +49-170-4527211 e - w http://www.unixarea.de/ UNIX since V7 on PDP-11 | UNIX on mainframe since ESER 1055 (IBM /370) UNIX on x86 since SVR4.2 UnixWare 2.1.2 | FreeBSD since 2.2.5