Invalid XML chars in long abstracts
Not sure if this is actually an error. When i use an XML representation of the long abstracts i get parse errors when accessing some abstracts because they contain invalid (not allowed by specification) XML chars. F.e. processing the english abstract for http://dbpedia.org/resource/Olive using openrdf results in:
org.openrdf.rio.RDFParseException An invalid XML character (Unicode: 0x1) was found in the element content of the document.
Maybe this is intentional regarding the other serialization formats do not have a problem with this, but it prevents XML processing of the data. Happens with version 3.9.
+1
@Hronom is this still valid?
Oh this happens a long time ago... I'll try check in near time.
trying to close old issues, when you do, feel free to close directly or re-comment
cheers!
ping @Hronom still valid? See https://databus.dbpedia.org/dbpedia/text/long-abstracts/
@Vehnem can we write test for this?