Date: Sun, 11 Oct 2009 01:11:52 -0700 From: Gary Kline <kline@thought.org> To: freebsd-questions@FreeBSD.ORG Subject: Re: for perl wizards. Message-ID: <20091011081151.GA98601@thought.org> In-Reply-To: <200910091026.n99AQPUv014685@lurza.secnetix.de> References: <20091009083516.GA60096@thought.org> <200910091026.n99AQPUv014685@lurza.secnetix.de>
next in thread | previous in thread | raw e-mail | index | archive | help
On Fri, Oct 09, 2009 at 12:26:25PM +0200, Oliver Fromme wrote: > Gary Kline <kline@thought.org> wrote: > > > > Whenever I save a wordpeocessoe file [OOo, say] into a > > text file, I get a slew of hex codes to indicate the char to be > > used. I'm looking for a perl one-liner or script to translate > > hex back into ', ", -- [that's a dash), and so forth. Why does > > this fail to trans the hex code to an apostrophe? > > > > perl -pi.bak -e 's/\xe2\x80\x99/'/g' > > You need to escape the inner quote character, of course. > I think sed is better suited for this task than perl. > > > If there any another other tools, I'm interested! > > That "hex code" rather looks like UTF-8. > > For conversion between character encodings I recommend recode > from the ports collection (ports/converters/recode). > For example, to convert file.txt from UTF-8 to ISO8859-15: > > $ recode utf8..iso8859-15 file.txt > > To preserve the previous file contents, do this: > > $ recode utf8..iso8859-15 <old.txt >new.txt recode works just fine, thanks. i'm not sure if openoffice asks to use 8859-1 or utf8, but yes, the text is interspersperced with bits of utf8. FWIW, I did check this on google before my post; still couldn't find anything that worked. and of course, above, with the " ' " i did try escaping the punctuation. my shell gave me problems regardless. dunno ... so i asked here:-) thanks to everyone, gary > > Best regards > Oliver > > -- > Oliver Fromme, secnetix GmbH & Co. KG, Marktplatz 29, 85567 Grafing b. M. > Handelsregister: Registergericht Muenchen, HRA 74606, Geschäftsfuehrung: > secnetix Verwaltungsgesellsch. mbH, Handelsregister: Registergericht Mün- > chen, HRB 125758, Geschäftsführer: Maik Bachmann, Olaf Erb, Ralf Gebhart > > FreeBSD-Dienstleistungen, -Produkte und mehr: http://www.secnetix.de/bsd > > "Python tricks" is a tough one, cuz the language is so clean. E.g., > C makes an art of confusing pointers with arrays and strings, which > leads to lotsa neat pointer tricks; APL mistakes everything for an > array, leading to neat one-liners; and Perl confuses everything > period, making each line a joyous adventure <wink>. > -- Tim Peters -- Gary Kline kline@thought.org http://www.thought.org Public Service Unix http://jottings.thought.org http://transfinite.thought.org The 7.31a release of Jottings: http://jottings.thought.org/index.php
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20091011081151.GA98601>