You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The dataset has some problems. For each source, the entities have been replaced by ids. However only the subject has been replaced. For example in D5 in DBpedia <dbp:eric_r%c3%bccker_eddison> has the id 1692349, but triples like 738353 <dbo:author> <dbp:eric_r%c3%bccker_eddison> exists. What's more problematic even is that in the ground truth the URL encoding is not used and the characters are missing and mixed-case is used. For D5 this is the respective line in the ground truth: <http://data.archiveshub.ac.uk/id/person/othersource/eddisonericrucker1882-1945author> <dbp:Eric_Rcker_Eddison>.
Another problem, is that the ground truth contains entities that do not show up in the dataset. For example for D5 the line <http://data.archiveshub.ac.uk/id/person/ncarules/streetarthurgeorge1892-1966farmerauthorandjournalist> <dbp:A._G._Street> contains the <dbp:A._G._Street> which does not show up anywhere anymore regardless of case.
Presented in this paper, link to datasets
The text was updated successfully, but these errors were encountered: