From owner-freebsd-questions@freebsd.org Wed Apr 22 23:23:01 2020 Return-Path: Delivered-To: freebsd-questions@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id 4357A2C60E1 for ; Wed, 22 Apr 2020 23:23:01 +0000 (UTC) (envelope-from dalescott@shaw.ca) Received: from smtp-out-no.shaw.ca (smtp-out-no.shaw.ca [64.59.134.12]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "Client", Issuer "CA" (not verified)) by mx1.freebsd.org (Postfix) with ESMTPS id 496xLX017Qz4728 for ; Wed, 22 Apr 2020 23:22:59 +0000 (UTC) (envelope-from dalescott@shaw.ca) Received: from cds220.dcs.int.inet ([10.0.153.144]) by shaw.ca with ESMTP id ROhkj6YCOng7KROhljGclN; Wed, 22 Apr 2020 17:22:57 -0600 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=shaw.ca; s=s20180605; t=1587597778; bh=wzST5Sz2beGmWhoZXdZLNNuN2GAdujfXBENKVRV1MjM=; h=Date:From:To:Cc:In-Reply-To:References:Subject; b=FpqYWcUNPUBMHuTiUiEO+nVgG4ntKsd73WAf1ytCOKHuJXDoZ9obcaDF9KsuygM0O lgFNsRH9AUmzq5UezGyGWF2OZDTNMLjYkfpMr+4PMuFZZYyaVVyBcw2/2fHrKAN76T mLJYAzqX4BE8PTikEZCePu2CzUCQ+zn59LbI70gj8NktQ4c1NzVDbuRftm/ufgmuT0 wOaU5662mw7FnulxqII9R4W/yzxaCOObZUlYSb8p0MvMtx7WVuRqCStFFL+q5mmG/f JH7eGKI0si8XVx+1UgKR2+Bls7am08N+sw6jW4e1x13jrFOJFeKyQ7aphngfEPNV21 B7N6szrU6rWIQ== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=shaw.ca; s=s20180605; t=1587597778; bh=wzST5Sz2beGmWhoZXdZLNNuN2GAdujfXBENKVRV1MjM=; h=Date:From:To:Cc:In-Reply-To:References:Subject; b=FpqYWcUNPUBMHuTiUiEO+nVgG4ntKsd73WAf1ytCOKHuJXDoZ9obcaDF9KsuygM0O lgFNsRH9AUmzq5UezGyGWF2OZDTNMLjYkfpMr+4PMuFZZYyaVVyBcw2/2fHrKAN76T mLJYAzqX4BE8PTikEZCePu2CzUCQ+zn59LbI70gj8NktQ4c1NzVDbuRftm/ufgmuT0 wOaU5662mw7FnulxqII9R4W/yzxaCOObZUlYSb8p0MvMtx7WVuRqCStFFL+q5mmG/f JH7eGKI0si8XVx+1UgKR2+Bls7am08N+sw6jW4e1x13jrFOJFeKyQ7aphngfEPNV21 B7N6szrU6rWIQ== X-Authority-Analysis: v=2.3 cv=ecemg4MH c=1 sm=1 tr=0 a=YjOmSjUxhsfmstj0eziGpw==:117 a=FKkrIqjQGGEA:10 a=on0NmgUIp3IA:10 a=IkcTkHD0fZMA:10 a=sulp-nw3AAAA:8 a=5mcg790sAAAA:8 a=UD7uQ7OiAAAA:8 a=6I5d2MoRAAAA:8 a=k0ifnYqO65iAiBDXv-4A:9 a=sOHqxOlo_otrBeT_:21 a=h8l8PrnOsfVWVuK-:21 a=QEXdDO2ut3YA:10 a=pUnO4FrxeG7axTgWw1u6:22 a=uQeDYW1NI25gHNlrW_eK:22 a=Zkq0o-JBKtHmMz2AGXNj:22 a=IjZwj45LgO3ly-622nXo:22 Date: Wed, 22 Apr 2020 17:22:56 -0600 (MDT) From: Dale Scott To: Jordan Cc: freebsd-questions , Polytropon Message-ID: <1584212720.148023480.1587597776062.JavaMail.zimbra@shaw.ca> In-Reply-To: References: <09e273ff-4d9d-47eb-a6e1-d91f18c8a0ef@www.fastmail.com> <20200423002530.87cf0d91.freebsd@edvax.de> Subject: Re: PDF Documents Manipulation Software options MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Originating-IP: [162.223.103.50, 162.223.103.50] X-Mailer: Zimbra 8.8.15_GA_3899 (ZimbraWebClient - GC81 (Win)/8.8.15_GA_3895) Thread-Topic: PDF Documents Manipulation Software options Thread-Index: jQBrXjoK8rAtQ4SgBWyDHUqMgVxn5A== X-CMAE-Envelope: MS4wfO742D/GnXzpB6hK4D4FpWC++nq/+r5RZT2fFlklTGIcX9tjVTT6LjRmdCGLMUfgVsAKq0nGf2FGB6y++1Z0ySecZ2xeXO7U64rdbL3epsik/rNxVJ4R TmZXpnaTHLwcwlFfN8e2Zmb8XTsXAL+SkVnFMj/1SdcRKjVCpyWOzZEFDNPMKedBxpl3xfIGTgW5+hWHnuOWAS1aMpbr1eyx8duCScE4KrJkhqhtH0VGTmUK WSRCFJBl3ZkiNBuVNmVZRA== X-Rspamd-Queue-Id: 496xLX017Qz4728 X-Spamd-Bar: ---- Authentication-Results: mx1.freebsd.org; dkim=pass header.d=shaw.ca header.s=s20180605 header.b=FpqYWcUN; dkim=pass header.d=shaw.ca header.s=s20180605 header.b=FpqYWcUN; dmarc=pass (policy=none) header.from=shaw.ca; spf=pass (mx1.freebsd.org: domain of dalescott@shaw.ca designates 64.59.134.12 as permitted sender) smtp.mailfrom=dalescott@shaw.ca X-Spamd-Result: default: False [-4.62 / 15.00]; ARC_NA(0.00)[]; NEURAL_HAM_MEDIUM(-1.00)[-1.000,0]; R_DKIM_ALLOW(0.00)[shaw.ca:s=s20180605]; HAS_XOIP(0.00)[]; FROM_HAS_DN(0.00)[]; RWL_MAILSPIKE_GOOD(0.00)[12.134.59.64.rep.mailspike.net : 127.0.0.18]; R_SPF_ALLOW(0.00)[+ip4:64.59.134.0/25]; RCPT_COUNT_THREE(0.00)[3]; MIME_GOOD(-0.10)[text/plain]; RCVD_TLS_LAST(0.00)[]; NEURAL_HAM_LONG(-1.00)[-1.000,0]; BAD_REP_POLICIES(0.10)[]; DWL_DNSWL_NONE(0.00)[shaw.ca.dwl.dnswl.org : 127.0.5.0]; TO_MATCH_ENVRCPT_SOME(0.00)[]; TO_DN_ALL(0.00)[]; URIBL_PBL(0.01)[pdflabs.com]; DKIM_TRACE(0.00)[shaw.ca:+]; DMARC_POLICY_ALLOW(0.00)[shaw.ca,none]; IP_SCORE(-2.53)[ip: (-6.73), ipnet: 64.59.128.0/20(-3.27), asn: 6327(-2.56), country: CA(-0.09)]; RCVD_IN_DNSWL_LOW(-0.10)[12.134.59.64.list.dnswl.org : 127.0.5.1]; FROM_EQ_ENVFROM(0.00)[]; MIME_TRACE(0.00)[0:+]; ASN(0.00)[asn:6327, ipnet:64.59.128.0/20, country:CA]; MID_RHS_MATCH_FROM(0.00)[]; RCVD_COUNT_TWO(0.00)[2] X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 22 Apr 2020 23:23:01 -0000 Have you looked at PDFtk? It's in the ports collection and claims to be a Java equivalent of PDFtk. IIUC, PDFtk is a product from PDFLabs (https://www.pdflabs.com/tools/pdftk-the-pdf-toolkit/), who also provide a CLI tool ("PDFtk Server") with source which you may be able to compile on FreeBSD. Fwiw, my usual desktop is Windows and I use "PDFTK-Builder" (a Windows GUI app) to do what you want "manually" (extract pages from various PDFs and merge to create a new PDF), which seems to be a Windows GUI with PDFtk Server underneath. However, I recall needing to use the CLI PDFtk Server directly once because PDFTK-Builder didn't expose a command-line switch I needed. Good luck! Write back and tell us what you ended up doing. Cheers, Dale --- Dale Scott Engineering and NPI Leader Web: www.dalescott.net ----- Original Message ----- > From: "Jordan" > To: "freebsd-questions" > Cc: "Polytropon" > Sent: Wednesday, April 22, 2020 4:50:03 PM > Subject: Re: PDF Documents Manipulation Software options > Hi Polytropon, > > Thank you for the response. > > I agree, manually manipulating PDFs with a GUI is not very efficient, however, > the work I do cannot be automated (atleast I haven't found a way). > > On a PDF I may have to extract out pages 3, 5 and 14 into a new 3 page PDF > because I only need the data from pages 3, 5 and 14. The very next PDF file I > work on I may need to extract out 20 pages, all in a row, from pages 20 to 40. > The next PDF file I may need the first and last page. There is absolutely no > common denominator from these three PDF's and the only thing common is that I > need to extract pages. > > In other words, I cannot write a script or use CLI to know ahead of time what > needs to be done, because it takes my eyes to visually see the pages and to > know what pages I need extracted. I cannot use grep or any text searching > function on these PDFs because on one PDF I may need certain pages that contain > words that have absolutely nothing to do with the pages I need on the next PDF > I work on. So, very unfortunately, this is a very manual job and requires > human beings (again, from my knowledge of CLI, scripting, etc). > > Using CLI would require me to visually look with my eyes a PDF file in a GUI > situation and then turn around, open a console, type in a command to extract > out a list of specific pages I need to then have to turn around and move that > newly created PDF file to a difference location that may be nested 30 folders > deep (and won't be consistent location as the next PDF I work within). It is > easier to visually click, hold CTRL to continue highlighting pages and then > right click, extract (which creates a new PDF) and then drag and drop the new > PDF pages into a GUI file manager (such as Thunar) to create the new PDF. > > I don't see how CLI tools will work with the above example workflows although I > am all ears to hearing others opinions. I know "professional" PDF software > such as Adobe Acrobat Pro works but it is only compatible with Windows (and > maybe GNU/Linux). I found another proprietary PDF Professional software but it > is only available for Windows, Mac and GNU/Linux. I am on a quest to find > something for FreeBSD as I believe it to be superior OS than the > aforementioned. > > -- > Jordan > > On Wed, Apr 22, 2020, at 3:25 PM, Polytropon wrote: >> On Wed, 22 Apr 2020 15:14:59 -0700, Jordan wrote: >> > I need PDF software that can add pages, remove pages, extract pages >> > and redact. >> >> With "redact", you're refering to PDF annotations? >> >> >> >> > All of the PDF ports I have found are simply viewers and don't allow >> > manipulations of the PDF in a single packaged GUI application. I work >> > with hundreds of PDFs each day so I cannot work within a CLI to >> > manipulate the pages. >> >> I'd say the exact opposite is the case: If you're going to deal with >> a lot of files, using CLI tools to automate those tasks usually is >> the more convenient way - instead of doing everything manually in >> linear time. :-) But this of course depends on many parameters, such >> as if the kind of manipulation always is "one of a kind" or can be >> somehow standardized... >> >> >> >> > I do a lot of extracting of pages into a new PDF by dragging and >> > dropping pages from PDF software to a GUI file manager such as >> > Thunar. >> >> This can easily be done with the GUI file manager of your choice >> plus pdftk. Page preview support is present already in tools like >> Thunar. >> >> >> >> > Any suggestions that you use or have heard that works with FreeBSD? >> >> Tools like LibreOffice (and surely also OpenOffice) can directly >> open and manipulate PDF files. Gimp also has support for this >> format. For CLI operations, pdftk and ImageMagick are versatile >> tools that can be combined with GUI tools as well as used for >> batch operations. >> >> Is this a starting point for you? >> >> >> >> -- >> Polytropon >> Magdeburg, Germany >> Happy FreeBSD user since 4.0 >> Andra moi ennepe, Mousa, ... >> > _______________________________________________ > freebsd-questions@freebsd.org mailing list > https://lists.freebsd.org/mailman/listinfo/freebsd-questions > To unsubscribe, send any mail to "freebsd-questions-unsubscribe@freebsd.org"