Wednesday, March 21, 2012

Fuzzy Grouping - First Name Similarities; Bill = William, etc...

Hello,

I was wondering how Fuzzy Grouping deals with and handles first name similarities.Is there a way to configure it so that Anthony = Tony, Bill = William, etc…?I created a simple package with several rows containing similar first names and ran the fuzzy grouping on the first name column.I received only one possible duplicate of Will = William which was at 56%.I lowered the threshold down to 1% and still only one match.

Now I understand and appreciate the reasons for this but was wondering if this type of situation was considered and a way of dealing with it is available.

Thanks,
Beac

Just a thought as my former employer was in the business of matchingpreserve the name provided but look at making a substitution for name matching so that everything is standardized. Thus, Will, Willy, Willie, Bill, Billy, Liam, Wm are all converted to the base name of William before matching is attempted. Store what was provided but use the standardized name for matching purposes.

We had a sizeable table of 2500 name substitutions and it worked well for us.
|||Thanks for the feedback Charles.

No comments:

Post a Comment