Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 15 Nov 2001 21:40:17 +0900 (JST)
From:      Hiroki Sato <hrs@eos.ocn.ne.jp>
To:        horcicka@FreeBSD.cz
Cc:        freebsd-doc@FreeBSD.org
Subject:   Re: Why TIDY can never work correctly with ISO-8859-2 and others
Message-ID:  <20011115.214017.71143189.hrs@sekine00.ee.noda.sut.ac.jp>
In-Reply-To: <20011115105650.W57038-100000@dual.ms.mff.cuni.cz>
References:  <20011115105650.W57038-100000@dual.ms.mff.cuni.cz>

next in thread | previous in thread | raw e-mail | index | archive | help
Martin Horcicka <horcicka@FreeBSD.cz> wrote
  in <20011115105650.W57038-100000@dual.ms.mff.cuni.cz>:

horcicka> And if you use char-encoding: raw - character entities with values above 255
horcicka> are not printed as entities - this is really bad in 8-bit encodings.

 Yes, Japanese docs also suffer from it.  The input routine of tidy expands
 any entities first, even if -raw flag is specified.

horcicka> In my opinion Tidy cannot be used for encodings it does not natively support
horcicka> (i.e. for Russian and Czech (- still not in main CVS) translations of pages
horcicka> and docs).

 I think so, too.

 As a workaround, we can apply a patch and use the modified
 version of tidy that can suppress to interpret given entities
 as entities themselves, but I do not know if it will be a good solution.

--
| Hiroki Sato  <hrs@FreeBSD.org>
|              <hrs@eos.ocn.ne.jp>

To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-doc" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20011115.214017.71143189.hrs>