extraction-framework
extraction-framework copied to clipboard
The software used to extract structured data from Wikipedia
Properties of type date seem to be extracted incorrectly from Persondata templates: http://dbpedia.org/page/Margaret_Hillis http://dbpedia.org/page/Fan_Linlin For more examples use 2nd query from http://sourceforge.net/p/dbpedia/mailman/message/32912547/ select distinct \* { ?s http://dbpedia.org/ontology/birthDate ?o filter(?o...
``` server/src/main/scala/org/dbpedia/extraction/server/DynamicExtractionManager.scala:66: warning: object Actor in package actors is deprecated: Use the akka.actor package instead. For migration from the scala.actors package refer to the Actors Migration Guide. [WARNING] private def...
Hello! Wikipedia template http://en.wikipedia.org/wiki/Template:Infobox_officeholder have atribute "term_start" and is used in plenty of Wiki-articles. But in DBpedia mapping statistics http://mappings.dbpedia.org/server/templatestatistics/en/?template=Infobox_officeholder the property "term_start" doesn't have instances ("property is mapped but...
[also on mlist](http://sourceforge.net/p/dbpedia/mailman/dbpedia-discussion/thread/02e401d01a17%24e40ece90%24ac2c6bb0%24%40alexiev%40ontotext.com/#msg33159943) topical_concepts gives the topical page for each category, eg ``` dbp:Category:Programming_languages skos:subject dbp:Programming_language ``` But there's no such property. I suggest to use `foaf:focus` "The underlying or...
This is a hard or maybe impossible problem to solve. Still I'll ask. On the bg.wikipedia, it seems that a lot of the data about settlements is modularized into lookup...
https://commons.wikimedia.org/w/index.php?title=File:Hristogdanov.jpg&action=edit has description in two languages: ``` | description = {{bg|Христо Г. Данов}} {{en|Hristo G. Danov}} ``` which would map perfectly to ``` dct:description "Христо Г. Данов"@bg, "Hristo G. Danov"@en....
bg.wikipedia uses a composite field "мерки" (размер бюст-талия-ханш, bust-waist-hip size). Eg see https://bg.wikipedia.org/wiki/Шаблон:Манекен_инфо: Мерки 90-61-91 см. (34-24-36 in.) There already are separate fields bustSize, waistSize, hipSize. But I don't think...
Enhancement to the mapping server: - On the template stats page ([eg this](http://mappings.dbpedia.org/server/templatestatistics/en/?template=Infobox_Olympic_event)) - for every field, there is a blue number of occurrences - make that a hyperlink to...
As @jcsahnwaldt discussed in https://github.com/dbpedia/mappings-tracker/issues/40, person data is handled two ways: [Persondata mapping](http://mappings.dbpedia.org/index.php/Mapping_en:Persondata), [PersondataExtractor.scala](https://github.com/dbpedia/extraction-framework/blob/master/core/src/main/scala/org/dbpedia/extraction/mappings/PersondataExtractor.scala) - PersondataExtractor.scala: sample http://data.dws.informatik.uni-mannheim.de/dbpedia/previews/2014_sl_en_sl_persondata_en.ttl.bz2.txt - enabled for English and German only - extracts 3 names, birth/death...
hey guys, only now gotten to take a look at your 2014 release, and want to sincerely congratulate you on such wonderful and well-presented work. Such an important project going...