Date: Fri, 9 Oct 2009 12:26:25 +0200 (CEST) From: Oliver Fromme <olli@lurza.secnetix.de> To: freebsd-questions@FreeBSD.ORG, kline@thought.org Subject: Re: for perl wizards. Message-ID: <200910091026.n99AQPUv014685@lurza.secnetix.de> In-Reply-To: <20091009083516.GA60096@thought.org>
next in thread | previous in thread | raw e-mail | index | archive | help
Gary Kline <kline@thought.org> wrote: > > Whenever I save a wordpeocessoe file [OOo, say] into a > text file, I get a slew of hex codes to indicate the char to be > used. I'm looking for a perl one-liner or script to translate > hex back into ', ", -- [that's a dash), and so forth. Why does > this fail to trans the hex code to an apostrophe? > > perl -pi.bak -e 's/\xe2\x80\x99/'/g' You need to escape the inner quote character, of course. I think sed is better suited for this task than perl. > If there any another other tools, I'm interested! That "hex code" rather looks like UTF-8. For conversion between character encodings I recommend recode from the ports collection (ports/converters/recode). For example, to convert file.txt from UTF-8 to ISO8859-15: $ recode utf8..iso8859-15 file.txt To preserve the previous file contents, do this: $ recode utf8..iso8859-15 <old.txt >new.txt Best regards Oliver -- Oliver Fromme, secnetix GmbH & Co. KG, Marktplatz 29, 85567 Grafing b. M. Handelsregister: Registergericht Muenchen, HRA 74606, Geschäftsfuehrung: secnetix Verwaltungsgesellsch. mbH, Handelsregister: Registergericht Mün- chen, HRB 125758, Geschäftsführer: Maik Bachmann, Olaf Erb, Ralf Gebhart FreeBSD-Dienstleistungen, -Produkte und mehr: http://www.secnetix.de/bsd "Python tricks" is a tough one, cuz the language is so clean. E.g., C makes an art of confusing pointers with arrays and strings, which leads to lotsa neat pointer tricks; APL mistakes everything for an array, leading to neat one-liners; and Perl confuses everything period, making each line a joyous adventure <wink>. -- Tim Peters
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200910091026.n99AQPUv014685>