Haitam Bouanane

Results 3 comments of Haitam Bouanane

To load just one shard without errors, you should use data_files directly with split set to "train", but don’t specify "allenai/c4", since that points to the full dataset with all...

My apologies, I’ve modified my previous answer. You just need to specify the full path, for example: https://huggingface.co/datasets/allenai/c4/resolve/main/en/c4-train.00000-of-01024.json.gz I hope this updated answer is helpful.