extraction-framework
extraction-framework copied to clipboard
merge Persondata mapping and PersondataExtractor.scala
As @jcsahnwaldt discussed in https://github.com/dbpedia/mappings-tracker/issues/40, person data is handled two ways: Persondata mapping, PersondataExtractor.scala
- PersondataExtractor.scala: sample http://data.dws.informatik.uni-mannheim.de/dbpedia/previews/2014_sl_en_sl_persondata_en.ttl.bz2.txt
- enabled for English and German only
- extracts 3 names, birth/death date/place, description
- Persondata map:
- extracts 1 name, alias, birth/death date/place, description
The two need to be merged. If PersondataExtractor doesn't do special processing, I propose to remove it for uniformity (one less extractor is a win, one more mapping is negligible).
Haven't checked for de mapping Persondatei
The ad-don information the extractor produces is foaf:surname & foaf:givenName
By breaking on comma?