Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 9 Oct 2009 12:26:25 +0200 (CEST)
From:      Oliver Fromme <olli@lurza.secnetix.de>
To:        freebsd-questions@FreeBSD.ORG, kline@thought.org
Subject:   Re: for perl wizards.
Message-ID:  <200910091026.n99AQPUv014685@lurza.secnetix.de>
In-Reply-To: <20091009083516.GA60096@thought.org>

next in thread | previous in thread | raw e-mail | index | archive | help
Gary Kline <kline@thought.org> wrote:
 > 
 >         Whenever I save a wordpeocessoe file [OOo, say] into a
 >         text file, I get a slew of hex codes to indicate the char to be
 >         used.  I'm looking for a perl one-liner or script to translate
 >         hex back into ', ", -- [that's a dash), and so forth.  Why does
 >         this fail to trans the hex code to an apostrophe?
 > 
 >         perl -pi.bak -e 's/\xe2\x80\x99/'/g'  

You need to escape the inner quote character, of course.
I think sed is better suited for this task than perl.

 >         If there any another other tools, I'm interested!

That "hex code" rather looks like UTF-8.

For conversion between character encodings I recommend recode
from the ports collection (ports/converters/recode).
For example, to convert file.txt from UTF-8 to ISO8859-15:

$ recode utf8..iso8859-15 file.txt

To preserve the previous file contents, do this:

$ recode utf8..iso8859-15 <old.txt >new.txt

Best regards
   Oliver

-- 
Oliver Fromme, secnetix GmbH & Co. KG, Marktplatz 29, 85567 Grafing b. M.
Handelsregister: Registergericht Muenchen, HRA 74606,  Geschäftsfuehrung:
secnetix Verwaltungsgesellsch. mbH, Handelsregister: Registergericht Mün-
chen, HRB 125758,  Geschäftsführer: Maik Bachmann, Olaf Erb, Ralf Gebhart

FreeBSD-Dienstleistungen, -Produkte und mehr:  http://www.secnetix.de/bsd

"Python tricks" is a tough one, cuz the language is so clean. E.g.,
C makes an art of confusing pointers with arrays and strings, which
leads to lotsa neat pointer tricks; APL mistakes everything for an
array, leading to neat one-liners; and Perl confuses everything
period, making each line a joyous adventure <wink>.
        -- Tim Peters



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200910091026.n99AQPUv014685>