question_generation icon indicating copy to clipboard operation
question_generation copied to clipboard

No such file or directory: '/root/.cache/huggingface/datasets/squad_multitask/highlight_qg_format/1.0.0/dataset_info.json'

Open tonyken12345 opened this issue 2 years ago • 5 comments

When i do python prepare_data.py,Why run out of this message?

tonyken12345 avatar Mar 24 '22 17:03 tonyken12345

me too,this code has many errors

binggoml avatar Apr 08 '22 08:04 binggoml

nlp.utils.info_utils.NonMatchingSplitsSizesError: [{'expected': SplitInfo(name='train', num_bytes=226286197, num_examples=253276, dataset_name='squad_multitask'), 'recorded': SplitInfo(name='train', num_bytes=226284739, num_examples=253275, dataset_name='squad_multitask')}]

JingxinLee avatar Apr 12 '22 09:04 JingxinLee

nlp.utils.info_utils.NonMatchingSplitsSizesError: [{'expected': SplitInfo(name='train', num_bytes=226286197, num_examples=253276, dataset_name='squad_multitask'), 'recorded': SplitInfo(name='train', num_bytes=226284739, num_examples=253275, dataset_name='squad_multitask')}]

I am facing the same error, did u solved it dude?

1836533846 avatar Jul 29 '22 03:07 1836533846

If you check the file data/squad_multitask/dataset_infos.json, you will find: "splits": {"train": {"name": "train", "num_bytes": 226286197, "num_examples": 253276,......

You can change "226286197" to "226284739" and change "253276" to "253275", then you can fix the error.

ZihaoLin0123 avatar Aug 11 '22 18:08 ZihaoLin0123

I am facing the same error, do you solved it now?

William9Baker avatar Jan 06 '23 16:01 William9Baker