extraction-framework icon indicating copy to clipboard operation
extraction-framework copied to clipboard

The software used to extract structured data from Wikipedia

Results 150 extraction-framework issues
Sort by recently updated
recently updated
newest added

Properties of type date seem to be extracted incorrectly from Persondata templates: http://dbpedia.org/page/Margaret_Hillis http://dbpedia.org/page/Fan_Linlin For more examples use 2nd query from http://sourceforge.net/p/dbpedia/mailman/message/32912547/ select distinct \* { ?s http://dbpedia.org/ontology/birthDate ?o filter(?o...

GSoC Warmup task
type: data
status: fix-provided
status: minidump-test-required
status: triage-discussion-needed

``` server/src/main/scala/org/dbpedia/extraction/server/DynamicExtractionManager.scala:66: warning: object Actor in package actors is deprecated: Use the akka.actor package instead. For migration from the scala.actors package refer to the Actors Migration Guide. [WARNING] private def...

type: software-bug
GSoC Warmup task
status: fix-required
status: test-method-required

Hello! Wikipedia template http://en.wikipedia.org/wiki/Template:Infobox_officeholder have atribute "term_start" and is used in plenty of Wiki-articles. But in DBpedia mapping statistics http://mappings.dbpedia.org/server/templatestatistics/en/?template=Infobox_officeholder the property "term_start" doesn't have instances ("property is mapped but...

question
type: data
status: triage-discussion-needed

[also on mlist](http://sourceforge.net/p/dbpedia/mailman/dbpedia-discussion/thread/02e401d01a17%24e40ece90%24ac2c6bb0%24%40alexiev%40ontotext.com/#msg33159943) topical_concepts gives the topical page for each category, eg ``` dbp:Category:Programming_languages skos:subject dbp:Programming_language ``` But there's no such property. I suggest to use `foaf:focus` "The underlying or...

type: data
status: fix-required
status: minidump-test-required

This is a hard or maybe impossible problem to solve. Still I'll ask. On the bg.wikipedia, it seems that a lot of the data about settlements is modularized into lookup...

type: data
status: fix-required
status: minidump-test-required

https://commons.wikimedia.org/w/index.php?title=File:Hristogdanov.jpg&action=edit has description in two languages: ``` | description = {{bg|Христо Г. Данов}} {{en|Hristo G. Danov}} ``` which would map perfectly to ``` dct:description "Христо Г. Данов"@bg, "Hristo G. Danov"@en....

type: data
status: fix-required
status: minidump-test-required

bg.wikipedia uses a composite field "мерки" (размер бюст-талия-ханш, bust-waist-hip size). Eg see https://bg.wikipedia.org/wiki/Шаблон:Манекен_инфо: Мерки 90-61-91 см. (34-24-36 in.) There already are separate fields bustSize, waistSize, hipSize. But I don't think...

Needs More Examples
type: data
status: fix-required
status: minidump-test-required
status: triage-discussion-needed

Enhancement to the mapping server: - On the template stats page ([eg this](http://mappings.dbpedia.org/server/templatestatistics/en/?template=Infobox_Olympic_event)) - for every field, there is a blue number of occurrences - make that a hyperlink to...

GSoC Warmup task
type: data
status: triage-discussion-needed

As @jcsahnwaldt discussed in https://github.com/dbpedia/mappings-tracker/issues/40, person data is handled two ways: [Persondata mapping](http://mappings.dbpedia.org/index.php/Mapping_en:Persondata), [PersondataExtractor.scala](https://github.com/dbpedia/extraction-framework/blob/master/core/src/main/scala/org/dbpedia/extraction/mappings/PersondataExtractor.scala) - PersondataExtractor.scala: sample http://data.dws.informatik.uni-mannheim.de/dbpedia/previews/2014_sl_en_sl_persondata_en.ttl.bz2.txt - enabled for English and German only - extracts 3 names, birth/death...

type: data
status: triage-discussion-needed

hey guys, only now gotten to take a look at your 2014 release, and want to sincerely congratulate you on such wonderful and well-presented work. Such an important project going...

documentation
type: data
status: triage-discussion-needed