Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 11 Apr 2002 06:19:29 -0400
From:      "Dan Langille" <dan@langille.org>
To:        Terry Lambert <tlambert2@mindspring.com>
Cc:        chat@freebsd.org
Subject:   Re: what are these characters please?
Message-ID:  <20020411102024.3E6283F30@bast.unixathome.org>
In-Reply-To: <3CB4FBFB.9D2AC7E0@mindspring.com>

next in thread | previous in thread | raw e-mail | index | archive | help
On 10 Apr 2002 at 19:59, Terry Lambert wrote:

> Dan Langille wrote:
> > I found these characters in a recent cvs-all commit:
> > 
> > 20 20 20 20 20 5b 53 75  62 6d 69 74 74 65 64 20  |     [Submitted | 62
> > 79 3a 20 56 69 6c 6c  65 20 53 6b 79 74 74 1b  |by: Ville Skytt.| 2c 41
> > 64 1b 28 42 20 3c  76 69 6c 6c 65 2e 73 6b  |,Ad.(B <ville.sk| 79 74 74
> > 61 40 69 6b 69  2e 66 69 3e 5d 0a 20 20  |ytta@iki.fi>].  |
> > 
> > When viewed under vi, I get:
> > 
> > Ville Skytt^[,Ad^[(B <ville.skytta@iki.fi>
> 
> ANSI character set selector escape sequence for 7 bit representation
> of 8 bit characters.
> 
> If I had to guess, I would say "eth", which is a "D" with a bar in it,
> unlike "thorn", which is an "O" with a forwars slash through it.  8-).
> 
> Obviously a deficiency in the encapsulation of a cut-and-paste
> that was not attributed by encoding, because CVS commit logs are
> not MIME encapsulated.

Given that I'm trying to process the cvs-all messages into XML documents 
(using the perl module XML::Writer which does not do any encoding beyond 
characters such as >, <, etc), any suggestions as to how to deal with such 
characters?  I've been looking through cpan but I suspect I'm using the 
wrong search criteria ("encoding").  Any clues?
-- 
Dan Langille
The FreeBSD Diary - http://freebsddiary.org/ - practical examples


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-chat" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20020411102024.3E6283F30>