Re: duplicates in a file
Re: duplicates in a file
- Subject: Re: duplicates in a file
- From: Rick Norman <email@hidden>
- Date: Fri, 19 Jul 2002 10:05:28 -0500
There are three columns, but where I really see the problem is like in the
following example.
(Example 1)
Jane Doe PO Box 123 City, ST 12345
Jane Doe P.O. Box 123 City, ST 12345
(Obviously these are duplicates, but when comparing the address lines, they
will appear to be different)
(Example 2)
Jane Doe PO Box 123 City, ST 12345
Joe and Jane Doe PO Box 123 City, ST 12345
Jane Doe P.O. Box 123 City, ST 12345
(These 3 records are all for the same recipient, but now there is a problem
comparing names and addresses)
If addresses are compared and there is a way to ignore the periods in the
P.O. Box so that P.O. and PO look the same then it could be done with some
accuracy.
Rick Norman
>
From: "Robert C. Jacobson" <email@hidden>
>
Date: Fri, 19 Jul 2002 10:47:22 -0400
>
To: Rick Norman <email@hidden>
>
Subject: Re: duplicates in a file
>
>
At 9:09 AM -0500 7/19/02, Rick Norman wrote:
>
> Anybody got any ideas on removing duplicate entries in a tab delimited file.
>
> I used the following script comparing "C1R1" against "C1R2" and so on. After
>
> looking a little closer, I realize that there needs to be a little more
>
> considered here, see the snip of the data to see what I'm referring to. This
>
> Excel script is excruciatingly slow, I'm hoping that this can be done with
>
> BBEdit.
>
> Thanks
>
> Rick Norman
>
>
>
>
Are you using Mac OS X?
>
Is there only one column?
>
>
If so, this can be done very easily from the Terminal:
>
sort file | uniq > newfile
>
>
If you're using OSX but there's more than one column, that's harder
>
but still possible.
>
>
--
>
-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
>
Robert Jacobson email@hidden
>
BS, Aeronautical Engineering Univ. of Md., College Park
>
Flight Ops. Team - SOlar Heliospheric Observatory (SOHO)
>
(301) 286-1591
_______________________________________________
applescript-users mailing list | email@hidden
Help/Unsubscribe/Archives:
http://www.lists.apple.com/mailman/listinfo/applescript-users
Do not post admin requests to the list. They will be ignored.