Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 5 Sep 2009 11:16:03 -0700
From:      Charlie Kester <corky1951@comcast.net>
To:        freebsd-questions@freebsd.org
Subject:   Re: Is there such thing as a 'soft checksum' tool?
Message-ID:  <20090905181603.GA387@comcast.net>
In-Reply-To: <64c038660909050933h25a91edcw56688993f5557ad2@mail.gmail.com>
References:  <64c038660909050933h25a91edcw56688993f5557ad2@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
On Sat 05 Sep 2009 at 09:33:03 PDT Modulok wrote:
>List,
>
>I'm not even sure such a tool exists, but it's worth asking:
>
>I'm looking for a pseudo-checksum tool for use with catalogging
>images. For example, a strict checksum algorithm, like the sha family,
>will produce a dramatically different checksum for two files which
>differ by only a single bit. I'm looking for something where two
>images images, which are similar, get a proportionally similar
>checksum. When I speak of similarities I'm referring to their image
>patterns. i.e two images of differing sizes, which are otherwise
>identical, would produce very similar checksums. So the closer the
>checksums are, the more similar two given images are.
>
>Does anyone know of anything like this?

libpuzzle might be what you're looking for.  There's a tool called ftwin
that uses libpuzzle to find duplicate or only-slightly-modified files.

http://libpuzzle.pureftpd.org/project/libpuzzle
http://jok.is-a-geek.net/ftwin.php

Both of these are in the portstree.  ;-)




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20090905181603.GA387>