Date: Tue, 24 Aug 1999 19:50:38 -0400 From: Tim Vanderhoek <vanderh@ecf.utoronto.ca> To: Nik Clayton <nik@freebsd.org> Cc: freebsd-doc@freebsd.org Subject: Re: DocBook formatting style? Message-ID: <19990824195038.D8869@ppp18415.on.bellglobal.com> In-Reply-To: <19990824172812.L65430@kilt.nothing-going-on.org>; from Nik Clayton on Tue, Aug 24, 1999 at 05:28:12PM %2B0100 References: <19990822200737.A65807@ppp18344.on.bellglobal.com> <19990823141611.A1770@catkin.nothing-going-on.org> <19990823172005.C42397@ppp18344.on.bellglobal.com> <19990824172812.L65430@kilt.nothing-going-on.org>
next in thread | previous in thread | raw e-mail | index | archive | help
On Tue, Aug 24, 1999 at 05:28:12PM +0100, Nik Clayton wrote: > > If we had a "sentence oriented" diff, it might be persuaded to blank out > the first and third sentences, because although their position in the file > has changed, their content has not. As a hack, you could do perl -pe 's/\<[^>]+\>//g; s/[\n\t ]+/ /g; s/[!?,.]/$&\n/g; ' | perl -pe 's/\s{2,}/ /g' < Englishfile1 > a perl -pe 's/\<[^>]+\>//g; s/[\n\t ]+/ /g; s/[!?,.]/$&\n/g; ' | perl -pe 's/\s{2,}/ /g' < Englishfile2 > b diff [-uc] a b That actually seems to work reasonably well for some changes. For other changes, it can make the diffs more confusing. [Don't mind the perl regexp --- perl and I weren't getting along today...] -- This is my .signature which gets appended to the end of my messages. To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-doc" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?19990824195038.D8869>