extraction-framework icon indicating copy to clipboard operation
extraction-framework copied to clipboard

The software used to extract structured data from Wikipedia

Results 150 extraction-framework issues
Sort by recently updated
recently updated
newest added

ConditionalMapping checks a sequence of Conditions and the first satisfied one wins. The Conditions themselves are atomic (no connectives). But this is not flexible enough. Consider http://mappings.dbpedia.org/index.php/Mapping_bg:Музикален_изпълнител: - We need...

GSoC Warmup task
type: data
status: fix-required
status: minidump-test-required

submitted by Marco Fossati in [1], [2] ## Description Up to now, the first template infobox on a Wikipedia article defines the DBpedia type of this article, while further infobox...

GSoC Warmup task
type: data
status: fix-required
status: minidump-test-required

(Suggested by @JensLehmann) For now the framework supports only the following owl axioms: - owl:equivalentClass - owl:disjointWith - owl:equivalentProperty with _owl:disjointWith_ not supported when exporting the ontology. (1) (in page...

enhancement
type: data
status: triage-discussion-needed

If I look at http://live.dbpedia.org/page/Azulene I do not see the extracted InChI, CAS registry number, nor PubChem compound ID. What is the reason here? (Is it possible to run the...

type: data
status: triage-discussion-needed

With regards to the dbpedia-owl:populationTotal property, a number of resources have population counts that are too small by a factor of about 1000: http://dbpedia.org/resource/Ciudad_Ayala,_Morelos "7"^^http://www.w3.org/2001/XMLSchema#integer http://dbpedia.org/resource/Garg%C5%BEdai "17"^^http://www.w3.org/2001/XMLSchema#integer http://dbpedia.org/resource/Urban_water_management_in_Bogot%C3%A1,_Colombia "7"^^http://www.w3.org/2001/XMLSchema#integer http://dbpedia.org/resource/Fruitland,_Maryland...

type: data
status: fix-required
status: minidump-test-required

http://mappings.dbpedia.org/server/templatestatistics/bg/?template=Селище_в_България says that 98.37 % of all property occurrences in Wikipedia ( bg ) are mapped. So we should be very happy. Then how come BG settlements have no cordinates?...

GSoC Warmup task
type: data
status: triage-discussion-needed

See eg http://mappings.dbpedia.org/server/extraction/bg/extract?format=turtle-triples&extractors=custom&title=София (any resource actually): ``` . "91916"^^ . . ``` Methinks that wikiPageUsesTemplate should be in the http://dbpedia.org/ontology namespace like the other two, because the idea "wikiPageUsesTemplate" is...

GSoC Warmup task
type: data
status: triage-discussion-needed

Numbered raw props are collapsed to one prop: http://fr.wikipedia.org/w/index.php?title=Antioche&action=edit: ``` | division = [[Région méditerranéenne]] | nom de division = [[Régions de Turquie|Région]] | division2 = [[Hatay]] | nom de...

type: data
status: triage-discussion-needed

For a television show, I would expect `dbpprop:showName` to be identical to `foaf:name`. However, in the case of '30 Rock', it's getting treated as a numeric '30' instead: ``` SELECT...

type: data
status: fix-required
status: minidump-test-required

[Modèle:Infobox_Territoire](http://fr.wikipedia.org/wiki/Mod%C3%A8le:Infobox_Territoire#Param.C3.A8tres) uses couples of name/language properties to identify local names and their respective languages. Those are direct ISO language codes. @jplu tried to use the language properties in [Mapping_fr:Infobox_Territoire](http://mappings.dbpedia.org/index.php?title=Mapping_fr:Infobox_Territoire&action=edit) to...

enhancement
type: data
status: fix-required
status: minidump-test-required