clang8 icon indicating copy to clipboard operation
clang8 copied to clipboard

Errors when bash run.sh

Open ghost opened this issue 3 years ago • 3 comments

Hi, thanks for your great work. Then, I run the following command, it works.

echo "Running a test..."
python -m prepare_clang8_dataset_test

However, when I run the following command, there are some error reports.

python -m prepare_clang8_dataset \
  --lang8_dir="${LANG8_DIR}" \
  --tokenize_text='True' \
  --languages='ru,de,en'

image

Hope for your suggestion, thank you !

ghost avatar Aug 18 '21 04:08 ghost

I would first check that the target files have been downloaded successfully by making sure that they contain the correct number of lines, see: https://github.com/google-research-datasets/clang8#data-format

ekQ avatar Aug 20 '21 10:08 ekQ

I would first check that the target files have been downloaded successfully by making sure that they contain the correct number of lines, see: https://github.com/google-research-datasets/clang8#data-format

Yeah, I got the same error and the reason was target files were not downloaded properly, it was just 1 kb files instead of original size, make sure Git Large File Storage installed as mentioned in the steps and try once.

DincyDavis avatar Sep 01 '21 13:09 DincyDavis

Run git lfs pull before the ./run.sh

mrqorib avatar Nov 22 '21 15:11 mrqorib