Hello Sarah, hello all,
some times ago I had the same problem and came accross this blogpost
that gives some advice about how to developp an algorithm that would do
that :
http://www.hackerfactor.com/blog/index.php?/archives/432-Looks-Like-It.html
. The update section on this blogpost has some links to scripts that
implement this algorithm. My project was canceled so I can't tell you if
it works well but according to comments it can be a good solution. But
if you have money to spend on it, http://services.tineye.com/MatchEngine
is definitely a good choice as their algorithm is quite good.
Hope this helps (and maybe that since my original research is quite old
now, there are now packaged tools that do it, I don't know)
--
Sylvain Machefert - Web services librarian
http://geobib.fr/en
Le 16/10/2014 18:38, Kyle Banerjee a écrit :
> Could you say something about the type of dup detection you need? Are we
> talking true duplicates, or possibly the same image in multiple formats,
> cropped, etc? Roughly how many images (thousands, tens of thousands, etc)
> and how big are they? Also, what did you try that did not meet your needs?
> Thanks,
>
> kyle
>
> On Wed, Oct 15, 2014 at 2:56 PM, Shipley, Sarah <[log in to unmask]>
> wrote:
>
>> Hi,
>>
>> I was wondering if anyone had any recommendations for image de-duping
>> software that compares the images rather than checksums. We're using
>> Visual Similarity Duplicate Image Finder, but find it's not as accurate as
>> we'd like. We have a very large number of images to de-dupe in our photo
>> archives and with the current software can't find a balance of comparison
>> that finds all the dups without producing a lot of false positives.
>>
>>
>> [cid:[log in to unmask]]
>>
>> Sarah Shipley, CA
>> Digital Asset Manager
>> Legislative Department - Office of the City Clerk
>> http://www.seattle.gov/leg/clerk/
>> 600 Fourth Avenue, Floor 3
>> PO Box 94728
>> Seattle, WA 98124-4728
>> 206.684.8119
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
|