RE-Context-or-Names icon indicating copy to clipboard operation
RE-Context-or-Names copied to clipboard

Pre-training data set problem

Open TANGTANG-BOY opened this issue 7 months ago • 0 comments

Hi, I download the dataset from Google cloud. When I run the prepare_data.py in pretrain/code file, there is a bug saying "json.decoder.JSONDecodeError: Expecting ':' delimiter: line 1 column 25283907,At the same time, the Tsinghua Cloud Drive link seems to have expired. Could you provide it again? Thank you.

TANGTANG-BOY avatar May 11 '25 13:05 TANGTANG-BOY