kge icon indicating copy to clipboard operation
kge copied to clipboard

Fail to preprocess Yago3-10 and DBPedia500

Open JothamWong opened this issue 3 years ago • 3 comments

Hello,

Running the download_all.sh script successfully downloads the abovementioned datasets but runs into issue when processing them, with error "UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 1086: character maps to ".

I tried to fix the issue by adding an encoding="utf-8" argument to the read file but it did not fix the problem.

Thank you for assisting.

JothamWong avatar Dec 09 '21 07:12 JothamWong

Hello, i just cloned the newest version of the repository and ran bash download_all.sh in the data directory. Everything worked as expected. Data is downloaded and preprocessed. I am working on Ubuntu 20.

Which operating system are you working on?

AdrianKs avatar Dec 09 '21 09:12 AdrianKs

Windows 10

JothamWong avatar Dec 09 '21 09:12 JothamWong

@psychicmario This is most likely due to a mismatch of encodings in your setup. Please provide the full stack trace to see where the error actually arises.

Note that we generally do not support the Windows platform. Under Windows, consider using WSL instead.

rgemulla avatar Dec 09 '21 10:12 rgemulla