litstudy icon indicating copy to clipboard operation
litstudy copied to clipboard

Improve fuzzy matching when calulating statistics

Open stijnh opened this issue 3 years ago • 0 comments

Calculating the statistics requires fuzzy matching of names. Currently, this matching is not too aggressive since we do want to avoid incorrectly matching two different names. The matching algorithm should be improved, possibly by adding additional parameters are asking the user if two names are equal?

Fuzzy matching appears in three places:

  • Affiliation names (e.g., "University of Amsterdam" == "the University of Amsterdam")
  • Author names (e.g., "John Doe" == "John. M. Doe"?)
  • Venue/conference/journals names (e.g., "Journal on Parallel Computing" == "J. Parallel Computing")

stijnh avatar Jan 25 '22 11:01 stijnh