TinyLlama
TinyLlama copied to clipboard
How do I use these data sets to train new models?
How do I use these data sets to train new models? https://huggingface.co/datasets/Skywork/SkyPile-150B https://huggingface.co/datasets/EleutherAI/proof-pile-2
@jzhang38 Can you provide a script? I'm a little confused on how to modify the script.
Hi we are working on these two datasets, will release the scripts when we finish.