Re: Best way of identifying duplicate files in Cocoa
Re: Best way of identifying duplicate files in Cocoa
- Subject: Re: Best way of identifying duplicate files in Cocoa
- From: Bill Bumgarner <email@hidden>
- Date: Tue, 20 Nov 2007 15:33:35 -0800
On Nov 20, 2007, at 2:48 PM, Michael Watson wrote:
I implemented MD5 hashing and comparison in a file diff utility I
wrote for internal use, and I gotta say . . . it was *fast* with
tens of thousands of files of varying size. (Say, anywhere from 4KB
to dozens of megs.)
So did I! Here is source:
http://svn.red-bean.com/bbum/trunk/hacques/dupinator.py
It checks the file sizes and then hashes the first 4k. Finally, it'll
hash the full file if the sizes and first 4k matches.
b.bum
_______________________________________________
Cocoa-dev mailing list (email@hidden)
Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden