From owner-freebsd-questions@FreeBSD.ORG Wed Jan 28 10:18:41 2009 Return-Path: Delivered-To: freebsd-questions@FreeBSD.ORG Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 816AC106566C for ; Wed, 28 Jan 2009 10:18:41 +0000 (UTC) (envelope-from reko.turja@liukuma.net) Received: from www.liukuma.net (www.liukuma.net [62.220.235.15]) by mx1.freebsd.org (Postfix) with ESMTP id 3ACC78FC16 for ; Wed, 28 Jan 2009 10:18:41 +0000 (UTC) (envelope-from reko.turja@liukuma.net) Received: from localhost (unknown [127.0.0.1]) by www.liukuma.net (Postfix) with ESMTP id 2578A1CC78; Wed, 28 Jan 2009 12:08:29 +0200 (EET) Received: from www.liukuma.net ([127.0.0.1]) by localhost (www.liukuma.net [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id YxDTTh4yUUIg; Wed, 28 Jan 2009 12:08:27 +0200 (EET) Received: from rivendell (a88-114-134-146.elisa-laajakaista.fi [88.114.134.146]) (using TLSv1 with cipher RC4-MD5 (128/128 bits)) (No client certificate requested) (Authenticated sender: ignatz@www.liukuma.net) by www.liukuma.net (Postfix) with ESMTP id 2746B1CC73; Wed, 28 Jan 2009 12:08:25 +0200 (EET) Message-ID: <319D789FD18042DBB7A19571DA26E5AE@rivendell> From: "Reko Turja" To: "Gary Kline" , "FreeBSD Mailing List" References: <20090128040802.GA94236@thought.org> In-Reply-To: <20090128040802.GA94236@thought.org> Date: Wed, 28 Jan 2009 12:08:55 +0200 MIME-Version: 1.0 Content-Type: text/plain; format=flowed; charset="iso-8859-1"; reply-type=original Content-Transfer-Encoding: quoted-printable X-Priority: 3 X-MSMail-Priority: Normal Importance: Normal X-Mailer: Microsoft Windows Live Mail 14.0.8050.1202 X-MimeOLE: Produced By Microsoft MimeOLE V14.0.8050.1202 Cc: Subject: Re: OCR... X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 28 Jan 2009 10:18:41 -0000 > so what is the best commercial/shareware that can read a 10pt-font > file? (( also, when i have time to get back into actually hacking, > this [[turning imaged pdf into OCR'able ascii or 8859-1]] is giong=20 > to > be a first target. any idea which team i should go with. gOCR=20 > looks > best so far to me. AABBYY Finereader - Omnipage haven't been able to catch it in several=20 years either feature or qualitywise. No idea if Finereader runs under=20 emulator though. If the file is already a PDF and 72 DPI with text as=20 graphics most of the damage has already been done, and it will be=20 extremely hard to OCR. -Reko=20