elie
elie
hey @chinthysl do you have the ckpt somewhere? :)
We are using this branch of datatrove: https://github.com/huggingface/datatrove/tree/nouamane/avoid-s3 (cc @NouamaneTazi) and make sure the config is: https://github.com/huggingface/smollm/blob/main/text/pretraining/smollm3/stage1_8T.yaml Updated the info on the smollm repo to add datatrove branch thanks https://github.com/huggingface/smollm/pull/97
Nanotron readme is not up to date, you need to use this branch of datatrove with the current dataloader https://github.com/huggingface/datatrove/tree/nouamane/avoid-s3 I recommend following this readme https://github.com/huggingface/smollm/tree/main/text/pretraining if you want to...