NanoCode012
Results
342
comments of
NanoCode012
Thanks, let us know how it goes
uhm, we don't really differentiate tokenized datasets in our sampler/collator. I suppose you need to first: 1) Adjust the code to not pre-tokenize , use `stream=True` . You'll need to...