extraction-framework icon indicating copy to clipboard operation
extraction-framework copied to clipboard

Not reading birth or death date for Theodora_(wife_of_Justinian_I)

Open twistedvisions opened this issue 11 years ago • 4 comments

See her entry here: http://en.wikipedia.org/wiki/Theodora_(wife_of_Justinian_I)

It reads both the places:

<http://dbpedia.org/resource/Theodora_(wife_of_Justinian_I)> <http://dbpedia.org/ontology/birthPlace> <http://dbpedia.org/resource/Cyprus> <http://en.wikipedia.org/wiki/Theodora_(wife_of_Justinian_I)?oldid=544575573#absolute-line=19> .
<http://dbpedia.org/resource/Theodora_(wife_of_Justinian_I)> <http://dbpedia.org/ontology/deathPlace> <http://dbpedia.org/resource/Constantinople> <http://en.wikipedia.org/wiki/Theodora_(wife_of_Justinian_I)?oldid=544575573#absolute-line=21> .

But the dates, which are in the info box, are not read.

I would guess the birth date fails because it is an estimate c. 500

I would also guess that the death date fails because the age is suffixed 28 June 548 (aged 48)

twistedvisions avatar Jan 30 '14 22:01 twistedvisions

It reads the death date of http://en.wikipedia.org/wiki/Albert,_Duke_in_Prussia, although that has an age appened, so I'm not sure about that theory anymore, (according to: http://mappings.dbpedia.org/server/mappings/en/extractionSamples/Mapping_en:Infobox_royalty)

twistedvisions avatar Jan 30 '14 22:01 twistedvisions

I added a test case with the failed string but the date was extracted successfully so I am not sure why it did. For Albert,_Duke_in_Prussiahttp://en.wikipedia.org/wiki/Albert,_Duke_in_Prussia it is a different case as it is using a template for the date generation

On Fri, Jan 31, 2014 at 12:40 AM, twistedvisions [email protected]:

Reopened #161 https://github.com/dbpedia/extraction-framework/issues/161 .

— Reply to this email directly or view it on GitHubhttps://github.com/dbpedia/extraction-framework/issues/161 .

Kontokostas Dimitris

jimkont avatar Jan 31 '14 15:01 jimkont

Weird - am I right, that there is no dates for Theodora in the 3.9 data? I can see those dates here: http://live.dbpedia.org/page/Theodora_(wife_of_Justinian_I)

It may have been fixed in the mean time since the release?

Is there a latest dump that I can collect to test these things, that isn't incremental?

wbecker avatar Jan 31 '14 15:01 wbecker

  • http://live.dbpedia.org/page/Albert,_Duke_in_Prussia is ok
  • http://live.dbpedia.org/page/Theodora_(wife_of_Justinian_I) should be fixed with #167 and #168

The DateTimeParser is still not able to handle c\. (\d){3,4} anyway

ninniuz avatar Feb 03 '14 19:02 ninniuz