extraction-framework icon indicating copy to clipboard operation
extraction-framework copied to clipboard

merge Persondata mapping and PersondataExtractor.scala

Open VladimirAlexiev opened this issue 10 years ago • 2 comments

As @jcsahnwaldt discussed in https://github.com/dbpedia/mappings-tracker/issues/40, person data is handled two ways: Persondata mapping, PersondataExtractor.scala

  • PersondataExtractor.scala: sample http://data.dws.informatik.uni-mannheim.de/dbpedia/previews/2014_sl_en_sl_persondata_en.ttl.bz2.txt
    • enabled for English and German only
    • extracts 3 names, birth/death date/place, description
  • Persondata map:
    • extracts 1 name, alias, birth/death date/place, description

The two need to be merged. If PersondataExtractor doesn't do special processing, I propose to remove it for uniformity (one less extractor is a win, one more mapping is negligible).

Haven't checked for de mapping Persondatei

VladimirAlexiev avatar Feb 23 '15 15:02 VladimirAlexiev

The ad-don information the extractor produces is foaf:surname & foaf:givenName

jimkont avatar Feb 25 '15 08:02 jimkont

By breaking on comma?

VladimirAlexiev avatar Feb 25 '15 10:02 VladimirAlexiev