fatcat icon indicating copy to clipboard operation
fatcat copied to clipboard

datacite importer duplicated author names

Open bnewbold opened this issue 5 years ago • 0 comments

Sometimes Datacite metadata includes the same people/entities as both "creators" and "contributors", and we end up duplicating them in fatcat metadata. Eg:

  • https://fatcat.wiki/release/spjysmrnsrgyzgq6ise5o44rlu/contribs
  • https://api.datacite.org/dois/10.25940/roper-31098406

I think the behavior should probably be to only add the contributors if they are not already in the author list by string check. Not sure if this should be a fuzzy string check; and exact check is a good start.

Will need to do cleanup as well.

bnewbold avatar Jul 02 '20 18:07 bnewbold