Mailing Lists: Apple Mailing Lists

Image of Mac OS face in stamp
 
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Comparing two images



From: Scott Ellsworth <email@hidden>

A friend just asked how one would scan approximately 40k images for
duplicates..  Anyone able to recommend a Java toolkit for same that
works with reasonable speed on the Mac?

This is a terribly complicated problem to do right, but I wanted to
try an "off the cuff" fast solution first.  Something perhaps a bit
more clever than a straight checksum of the image data, like a color
histogram.  This would let us decide whether something smart that
actually analyzes the images is a good idea.

comparing (checksum, file size) worked pretty well for me. keeping a serialized form of the library's information made incremental addition very fast.


obviously, if you could have the same image data with different metadata, this is no use. even then, i'd probably still start with the same technique on just the image data.

i'd be interested to hear about other techniques that people have actually tried and know to work, though.

--
Elliott Hughes, http://www.jessies.org/~enh/

_______________________________________________
Do not post admin requests to the list. They will be ignored.
Java-dev mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/java-dev/email@hidden

This email sent to email@hidden


Visit the Apple Store online or at retail locations.
1-800-MY-APPLE

Contact Apple | Terms of Use | Privacy Policy

Copyright © 2007 Apple Inc. All rights reserved.