From owner-freebsd-doc@FreeBSD.ORG  Sat Dec 29 12:48:39 2012
Return-Path: <owner-freebsd-doc@FreeBSD.ORG>
Delivered-To: doc@FreeBSD.org
Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52])
 by hub.freebsd.org (Postfix) with ESMTP id 9DF36A27;
 Sat, 29 Dec 2012 12:48:39 +0000 (UTC) (envelope-from uqs@FreeBSD.org)
Received: from acme.spoerlein.net (acme.spoerlein.net
 [IPv6:2a01:4f8:131:23c2::1])
 by mx1.freebsd.org (Postfix) with ESMTP id 2FA0E8FC13;
 Sat, 29 Dec 2012 12:48:39 +0000 (UTC)
Received: from localhost (acme.spoerlein.net [IPv6:2a01:4f8:131:23c2::1])
 by acme.spoerlein.net (8.14.5/8.14.5) with ESMTP id qBTCmZoO010526
 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES128-SHA bits=128 verify=NO);
 Sat, 29 Dec 2012 13:48:37 +0100 (CET) (envelope-from uqs@FreeBSD.org)
Date: Sat, 29 Dec 2012 13:48:33 +0100
From: Ulrich =?utf-8?B?U3DDtnJsZWlu?= <uqs@FreeBSD.org>
To: Gabor Kovesdan <gabor@FreeBSD.org>
Subject: Re: Please review, small SGML entity cleanup
Message-ID: <20121229124833.GC69724@acme.spoerlein.net>
References: <20121228171424.GZ69724@acme.spoerlein.net>
 <50DECF3E.2020502@FreeBSD.org>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Disposition: inline
Content-Transfer-Encoding: 8bit
In-Reply-To: <50DECF3E.2020502@FreeBSD.org>
User-Agent: Mutt/1.5.21 (2010-09-15)
Cc: doc@FreeBSD.org
X-BeenThere: freebsd-doc@freebsd.org
X-Mailman-Version: 2.1.14
Precedence: list
List-Id: Documentation project <freebsd-doc.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/options/freebsd-doc>,
 <mailto:freebsd-doc-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-doc>
List-Post: <mailto:freebsd-doc@freebsd.org>
List-Help: <mailto:freebsd-doc-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-doc>,
 <mailto:freebsd-doc-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Sat, 29 Dec 2012 12:48:39 -0000

On Sat, 2012-12-29 at 12:08:46 +0100, Gabor Kovesdan wrote:
> On 2012.12.28. 18:14, Ulrich Spörlein wrote:
> > The DE and FR articles are a hodgepodge of SGML entities and direct,
> > 8bit chars, with the former being the majority. This patch cleans this
> > up a little, although we should eventually switch this all to UTF-8,
> > obviously.
> Don't they work with direct chars? Once we made a step to that direction 

They probably will, and I have no clue why we used entities for German
and French, but the usual encodings for Russian and Japanese, etc.

> so this one would be one step back. If possible, it would be better to 
> convert the entities to direct chars instead of the opposite.

In the end, sure. But that's a larger project of moving from
de_DE.ISO8859-1 -> de_DE (with an implied UTF-8 encoding, as is required
by XML anyway, the implied part, not the exact encoding).

I don't think this commit is a step back, because the documents need to
be converted using a long series of s/&uuml;/ü/g, anyway. And the
current mish-mash is just weird.

Cheers,
Uli