NanoCode012

Results 342 comments of NanoCode012

Thanks, let us know how it goes

uhm, we don't really differentiate tokenized datasets in our sampler/collator. I suppose you need to first: 1) Adjust the code to not pre-tokenize , use `stream=True` . You'll need to...