Re: Comparing titles for similarity
Re: Comparing titles for similarity
- Subject: Re: Comparing titles for similarity
- From: Mark Edwards <email@hidden>
- Date: Fri, 24 Jun 2005 14:58:41 +1000
Hi Owen,
Google
http://www.google.com/search?
client=safari&rls=en&q=java+levenshtein&ie=UTF-8&oe=UTF-8 turned up
these which might help get you started:
http://www.javalobby.org/java/forums/t15908.html
http://www.javalobby.org/forums/thread.jspa?threadID=16000&tstart=0
Regards,
Mark
On 24 Jun 2005, at 14:33, Owen McKerrow wrote:
Hi All,
Have an application where a user enters title of a publication which
I then need to check the DB for any similar sounding publications and
return the list if any are found.
Now the titles of these publications can be quite long e.g.
Online mentoring and peer support: Using learning technologies to
facilitate entry into a community of practice
or
Teacher Involvement in the Policy Process: The Key to a Future for the
Physical and Health Education Profession
So I can of course check for the exact match, but what if one has a
spelling mistake or uses a different word (an instead of and or
something like that ).
So I was think I really need to grab say the first 30% and compare on
that. Is that a reasonable idea ?
Has anyone dealt with stuff like this before ?
A friend pointed me to the levenshtein function in PHP, do we have
something similar in java/openbase ?
http://php.planetmirror.com/manual/en/function.levenshtein.php
Owen McKerrow
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Webobjects-dev mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden