• Open Menu Close Menu
  • Apple
  • Shopping Bag
  • Apple
  • Mac
  • iPad
  • iPhone
  • Watch
  • TV
  • Music
  • Support
  • Search apple.com
  • Shopping Bag

Lists

Open Menu Close Menu
  • Terms and Conditions
  • Lists hosted on this site
  • Email the Postmaster
  • Tips for posting to public mailing lists
Re: Best way of identifying duplicate files in Cocoa
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Best way of identifying duplicate files in Cocoa


  • Subject: Re: Best way of identifying duplicate files in Cocoa
  • From: Bill Bumgarner <email@hidden>
  • Date: Tue, 20 Nov 2007 15:33:35 -0800

On Nov 20, 2007, at 2:48 PM, Michael Watson wrote:
I implemented MD5 hashing and comparison in a file diff utility I wrote for internal use, and I gotta say . . . it was *fast* with tens of thousands of files of varying size. (Say, anywhere from 4KB to dozens of megs.)

So did I! Here is source:

http://svn.red-bean.com/bbum/trunk/hacques/dupinator.py

It checks the file sizes and then hashes the first 4k. Finally, it'll hash the full file if the sizes and first 4k matches.

b.bum

_______________________________________________

Cocoa-dev mailing list (email@hidden)

Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com

Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden


  • Follow-Ups:
    • Re: Best way of identifying duplicate files in Cocoa
      • From: Jean-Daniel Dupas <email@hidden>
References: 
 >Best way of identifying duplicate files in Cocoa (From: Frank Reiff <email@hidden>)
 >Re: Best way of identifying duplicate files in Cocoa (From: Jean-Daniel Dupas <email@hidden>)
 >Re: Best way of identifying duplicate files in Cocoa (From: Frank Reiff <email@hidden>)
 >Re: Best way of identifying duplicate files in Cocoa (From: Michael Watson <email@hidden>)

  • Prev by Date: Re: "Cue Sheet Parser" - Use NSString?
  • Next by Date: Re: "Cue Sheet Parser" - Use NSString?
  • Previous by thread: Re: Best way of identifying duplicate files in Cocoa
  • Next by thread: Re: Best way of identifying duplicate files in Cocoa
  • Index(es):
    • Date
    • Thread