elie

Results 3 comments of elie

hey @chinthysl do you have the ckpt somewhere? :)

We are using this branch of datatrove: https://github.com/huggingface/datatrove/tree/nouamane/avoid-s3 (cc @NouamaneTazi) and make sure the config is: https://github.com/huggingface/smollm/blob/main/text/pretraining/smollm3/stage1_8T.yaml Updated the info on the smollm repo to add datatrove branch thanks https://github.com/huggingface/smollm/pull/97

Nanotron readme is not up to date, you need to use this branch of datatrove with the current dataloader https://github.com/huggingface/datatrove/tree/nouamane/avoid-s3 I recommend following this readme https://github.com/huggingface/smollm/tree/main/text/pretraining if you want to...