extraction-framework
extraction-framework copied to clipboard
Some Dbpedia countries returns invalid 'name' attributes
name = font-size:<num>%;
Performing the next query to dbpedia:
PREFIX dbo: <http://dbpedia.org/ontology/>
PREFIX dbr: <http://dbpedia.org/resource/>
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
SELECT ?country ?label ?longName ?name
WHERE {
?country a dbo:Country.
?country dbo:capital ?capital.
?country rdfs:label ?label .
?country dbo:longName ?longName .
?country foaf:name ?name .
FILTER ( regex(?name, "font-size"))
}
you can see that some countries returns: font-size:<num>%; in the name.
name with two single quotation marks
Performing the next query to dbpedia:
PREFIX dbo: <http://dbpedia.org/ontology/>
PREFIX dbr: <http://dbpedia.org/resource/>
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
SELECT ?country ?label ?longName ?name
WHERE {
?country a dbo:Country.
?country dbo:capital ?capital.
?country rdfs:label ?label .
?country dbo:longName ?longName .
?country foaf:name ?name .
FILTER ( regex(?name, "''"))
}
you can see that some countries returns '' in their names and others contains '' at the beggining, like El Salvador.