From owner-freebsd-doc@FreeBSD.ORG Sat Dec 29 12:48:39 2012 Return-Path: Delivered-To: doc@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 9DF36A27; Sat, 29 Dec 2012 12:48:39 +0000 (UTC) (envelope-from uqs@FreeBSD.org) Received: from acme.spoerlein.net (acme.spoerlein.net [IPv6:2a01:4f8:131:23c2::1]) by mx1.freebsd.org (Postfix) with ESMTP id 2FA0E8FC13; Sat, 29 Dec 2012 12:48:39 +0000 (UTC) Received: from localhost (acme.spoerlein.net [IPv6:2a01:4f8:131:23c2::1]) by acme.spoerlein.net (8.14.5/8.14.5) with ESMTP id qBTCmZoO010526 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES128-SHA bits=128 verify=NO); Sat, 29 Dec 2012 13:48:37 +0100 (CET) (envelope-from uqs@FreeBSD.org) Date: Sat, 29 Dec 2012 13:48:33 +0100 From: Ulrich =?utf-8?B?U3DDtnJsZWlu?= To: Gabor Kovesdan Subject: Re: Please review, small SGML entity cleanup Message-ID: <20121229124833.GC69724@acme.spoerlein.net> References: <20121228171424.GZ69724@acme.spoerlein.net> <50DECF3E.2020502@FreeBSD.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <50DECF3E.2020502@FreeBSD.org> User-Agent: Mutt/1.5.21 (2010-09-15) Cc: doc@FreeBSD.org X-BeenThere: freebsd-doc@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Documentation project List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 29 Dec 2012 12:48:39 -0000 On Sat, 2012-12-29 at 12:08:46 +0100, Gabor Kovesdan wrote: > On 2012.12.28. 18:14, Ulrich Spörlein wrote: > > The DE and FR articles are a hodgepodge of SGML entities and direct, > > 8bit chars, with the former being the majority. This patch cleans this > > up a little, although we should eventually switch this all to UTF-8, > > obviously. > Don't they work with direct chars? Once we made a step to that direction They probably will, and I have no clue why we used entities for German and French, but the usual encodings for Russian and Japanese, etc. > so this one would be one step back. If possible, it would be better to > convert the entities to direct chars instead of the opposite. In the end, sure. But that's a larger project of moving from de_DE.ISO8859-1 -> de_DE (with an implied UTF-8 encoding, as is required by XML anyway, the implied part, not the exact encoding). I don't think this commit is a step back, because the documents need to be converted using a long series of s/ü/ü/g, anyway. And the current mish-mash is just weird. Cheers, Uli