extraction-framework
extraction-framework copied to clipboard
The software used to extract structured data from Wikipedia
ConditionalMapping checks a sequence of Conditions and the first satisfied one wins. The Conditions themselves are atomic (no connectives). But this is not flexible enough. Consider http://mappings.dbpedia.org/index.php/Mapping_bg:Музикален_изпълнител: - We need...
submitted by Marco Fossati in [1], [2] ## Description Up to now, the first template infobox on a Wikipedia article defines the DBpedia type of this article, while further infobox...
(Suggested by @JensLehmann) For now the framework supports only the following owl axioms: - owl:equivalentClass - owl:disjointWith - owl:equivalentProperty with _owl:disjointWith_ not supported when exporting the ontology. (1) (in page...
If I look at http://live.dbpedia.org/page/Azulene I do not see the extracted InChI, CAS registry number, nor PubChem compound ID. What is the reason here? (Is it possible to run the...
With regards to the dbpedia-owl:populationTotal property, a number of resources have population counts that are too small by a factor of about 1000: http://dbpedia.org/resource/Ciudad_Ayala,_Morelos "7"^^http://www.w3.org/2001/XMLSchema#integer http://dbpedia.org/resource/Garg%C5%BEdai "17"^^http://www.w3.org/2001/XMLSchema#integer http://dbpedia.org/resource/Urban_water_management_in_Bogot%C3%A1,_Colombia "7"^^http://www.w3.org/2001/XMLSchema#integer http://dbpedia.org/resource/Fruitland,_Maryland...
http://mappings.dbpedia.org/server/templatestatistics/bg/?template=Селище_в_България says that 98.37 % of all property occurrences in Wikipedia ( bg ) are mapped. So we should be very happy. Then how come BG settlements have no cordinates?...
See eg http://mappings.dbpedia.org/server/extraction/bg/extract?format=turtle-triples&extractors=custom&title=София (any resource actually): ``` . "91916"^^ . . ``` Methinks that wikiPageUsesTemplate should be in the http://dbpedia.org/ontology namespace like the other two, because the idea "wikiPageUsesTemplate" is...
Numbered raw props are collapsed to one prop: http://fr.wikipedia.org/w/index.php?title=Antioche&action=edit: ``` | division = [[Région méditerranéenne]] | nom de division = [[Régions de Turquie|Région]] | division2 = [[Hatay]] | nom de...
For a television show, I would expect `dbpprop:showName` to be identical to `foaf:name`. However, in the case of '30 Rock', it's getting treated as a numeric '30' instead: ``` SELECT...
[Modèle:Infobox_Territoire](http://fr.wikipedia.org/wiki/Mod%C3%A8le:Infobox_Territoire#Param.C3.A8tres) uses couples of name/language properties to identify local names and their respective languages. Those are direct ISO language codes. @jplu tried to use the language properties in [Mapping_fr:Infobox_Territoire](http://mappings.dbpedia.org/index.php?title=Mapping_fr:Infobox_Territoire&action=edit) to...