extraction-framework
extraction-framework copied to clipboard
Not reading birth or death date for Theodora_(wife_of_Justinian_I)
See her entry here: http://en.wikipedia.org/wiki/Theodora_(wife_of_Justinian_I)
It reads both the places:
<http://dbpedia.org/resource/Theodora_(wife_of_Justinian_I)> <http://dbpedia.org/ontology/birthPlace> <http://dbpedia.org/resource/Cyprus> <http://en.wikipedia.org/wiki/Theodora_(wife_of_Justinian_I)?oldid=544575573#absolute-line=19> .
<http://dbpedia.org/resource/Theodora_(wife_of_Justinian_I)> <http://dbpedia.org/ontology/deathPlace> <http://dbpedia.org/resource/Constantinople> <http://en.wikipedia.org/wiki/Theodora_(wife_of_Justinian_I)?oldid=544575573#absolute-line=21> .
But the dates, which are in the info box, are not read.
I would guess the birth date fails because it is an estimate c. 500
I would also guess that the death date fails because the age is suffixed 28 June 548 (aged 48)
It reads the death date of http://en.wikipedia.org/wiki/Albert,_Duke_in_Prussia, although that has an age appened, so I'm not sure about that theory anymore, (according to: http://mappings.dbpedia.org/server/mappings/en/extractionSamples/Mapping_en:Infobox_royalty)
I added a test case with the failed string but the date was extracted successfully so I am not sure why it did. For Albert,_Duke_in_Prussiahttp://en.wikipedia.org/wiki/Albert,_Duke_in_Prussia it is a different case as it is using a template for the date generation
On Fri, Jan 31, 2014 at 12:40 AM, twistedvisions [email protected]:
Reopened #161 https://github.com/dbpedia/extraction-framework/issues/161 .
— Reply to this email directly or view it on GitHubhttps://github.com/dbpedia/extraction-framework/issues/161 .
Kontokostas Dimitris
Weird - am I right, that there is no dates for Theodora in the 3.9 data? I can see those dates here: http://live.dbpedia.org/page/Theodora_(wife_of_Justinian_I)
It may have been fixed in the mean time since the release?
Is there a latest dump that I can collect to test these things, that isn't incremental?
- http://live.dbpedia.org/page/Albert,_Duke_in_Prussia is ok
- http://live.dbpedia.org/page/Theodora_(wife_of_Justinian_I) should be fixed with #167 and #168
The DateTimeParser is still not able to handle c\. (\d){3,4}
anyway