RWKV-LM icon indicating copy to clipboard operation
RWKV-LM copied to clipboard

precomputed pile binidx dataset

Open RichardErkhov opened this issue 1 year ago • 2 comments

pile binidx dataset with 20b_tokenizer: https://huggingface.co/datasets/RichardErkhov/RWKV-LM_pile_binidx_dataset

RichardErkhov avatar Jun 23 '23 08:06 RichardErkhov