Comparing titles for similarity
Comparing titles for similarity
- Subject: Comparing titles for similarity
- From: Owen McKerrow <email@hidden>
- Date: Fri, 24 Jun 2005 14:33:38 +1000
Hi All,
Have an application where a user enters title of a publication which I
then need to check the DB for any similar sounding publications and
return the list if any are found.
Now the titles of these publications can be quite long e.g.
Online mentoring and peer support: Using learning technologies to
facilitate entry into a community of practice
or
Teacher Involvement in the Policy Process: The Key to a Future for the
Physical and Health Education Profession
So I can of course check for the exact match, but what if one has a
spelling mistake or uses a different word (an instead of and or
something like that ).
So I was think I really need to grab say the first 30% and compare on
that. Is that a reasonable idea ?
Has anyone dealt with stuff like this before ?
A friend pointed me to the levenshtein function in PHP, do we have
something similar in java/openbase ?
http://php.planetmirror.com/manual/en/function.levenshtein.php
Owen McKerrow
WebMaster, emlab
http://emlab.uow.edu.au
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
- - - - - - - -
"As of tomorrow, employees will only be able to access the building
using individual security cards. Pictures will be taken next Wednesday
employees will receive their cards in two weeks."
- "Dilbert Quotes" Winner, Fred Dales, Microsoft Corp
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Webobjects-dev mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden