From owner-freebsd-doc Sun Mar 23 18:40:34 2003 Delivered-To: freebsd-doc@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 1A44137B401 for ; Sun, 23 Mar 2003 18:40:33 -0800 (PST) Received: from mailsrv.otenet.gr (mailsrv.otenet.gr [195.170.0.5]) by mx1.FreeBSD.org (Postfix) with ESMTP id 58E2443F75 for ; Sun, 23 Mar 2003 18:40:31 -0800 (PST) (envelope-from keramida@ceid.upatras.gr) Received: from gothmog.gr (patr530-b223.otenet.gr [212.205.244.231]) by mailsrv.otenet.gr (8.12.8/8.12.8) with ESMTP id h2O2eRh1019981; Mon, 24 Mar 2003 04:40:28 +0200 (EET) Received: from gothmog.gr (gothmog [127.0.0.1]) by gothmog.gr (8.12.8/8.12.8) with ESMTP id h2O2eQqu023259; Mon, 24 Mar 2003 04:40:26 +0200 (EET) (envelope-from keramida@ceid.upatras.gr) Received: (from giorgos@localhost) by gothmog.gr (8.12.8/8.12.8/Submit) id h2O2eQ0o023258; Mon, 24 Mar 2003 04:40:26 +0200 (EET) (envelope-from keramida@ceid.upatras.gr) Date: Mon, 24 Mar 2003 04:40:26 +0200 From: Giorgos Keramidas To: Jeroen Ruigrok/asmodai Cc: freebsd-doc@FreeBSD.ORG Subject: Re: docs/50211: [PATCH] Fix textfile creation Message-ID: <20030324024026.GA23139@gothmog.gr> References: <200303231710.h2NHAGEb024196@freefall.freebsd.org> <20030324020745.GA22656@gothmog.gr> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20030324020745.GA22656@gothmog.gr> X-RAVMilter-Version: 8.4.2(snapshot 20021217) (terpsi) Sender: owner-freebsd-doc@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.org On 2003-03-24 04:07, Giorgos Keramidas wrote: > giorgos@gothmog[03:57]/tmp$ elinks -dump-charset ISO-8859-7 -dump 1 lala.html > AAeec,ieeue eaass`iaaii. > giorgos@gothmog[03:57]/tmp$ grep charset lala.html > > > Hrmf... Doesn't quite work. At least, it doesn't work without > tweaking the ~/.elinks files and stuff. This is bad, because we can't > use elinks in batch mod conversion of many different languages and > charsets without first configuring it through the curses interface. > > There is an -eval command line option that should probably work fine > with non ISO-8859-1 texts, when used as: > > elinks -eval 'set document.codepage.assume = "ISO-8859-7"' \ > -eval 'set terminal.vt220.charset = "ISO-8859-7"' \ > -dump 1 lala.html > > but I can't seen to find any good way of making this output raw 8-bit > text for Greek :( Hmmm, now that I think about this reply a bit more, it might sound like I'm being negative just for the sake of it. I'm not. I have been using w3m for producing text versions of the few Greek documents I managed to write so far. It just works. No special tweaking of ~/.w3m needed, no strange conversions donen to 8-bit text. The patch that makes our doc/ tree use w3m is very small too. Less than 50 lines in `cvs di -u' format. http://people.freebsd.org/~keramida/diff/w3m_docs.patch If anyone is interested in testing this with a full doc and/or web build on i386, alpha and sparc64 I'd be very glad to commit it and update textproc/docproj to depend on w3m instead of links. - Giorgos To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-doc" in the body of the message