Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 9 Aug 2011 16:40:14 -0400
From:      Alejandro Imass <>
Subject:   Re: extracting text from docx files
Message-ID:  <>
In-Reply-To: <>
References:  <> <> <>

Next in thread | Previous in thread | Raw E-Mail | Index | Archive | Help
On Tue, Aug 9, 2011 at 3:57 PM, Antonio Olivares
<> wrote:
>> But if you really, really need to read docx, you can try the web
>> application from Microsoft. A few months ago, I got also a lot of docx
>> and I opend it with the microsoft web app; this worked for me to extract
>> the information...

just a thought here but if docx is XML why not just find/build some
XSLT that extracts what you need into another format?
you probably have libxml2 and libxslt already in your system, and the
command line utility: xsltproc
there are probably already existing XSLT to transform to RTF and plain text.

Alejandro Imass

Want to link to this message? Use this URL: <>