LargeEA icon indicating copy to clipboard operation
LargeEA copied to clipboard

issues about the 1M datasets

Open OceanTangWei opened this issue 2 years ago • 2 comments

I am very interested in your work, but I found that the entities number is unequal to those shown in the paper, e.g. in the EN-FR dataset, the paper shows that EN has 1,877,793 entities, but in the real dataset, I found EN has only 1275304 entities. Is there anything wrong?

OceanTangWei avatar Apr 09 '23 16:04 OceanTangWei

This is indeed a problem with the data as I investigated today. I am thinking of some merging process caused the problem (This is because we wanted to use both ILLs in DBpedia and sameas.org). I am trying to recall how the data was built, but it was too long ago. So I need some time to figure it out. I will get back to you as soon as I find something.

xz-liu avatar Apr 10 '23 10:04 xz-liu

Hello, is this issue solved?

OceanTangWei avatar May 07 '23 06:05 OceanTangWei