open_llama
open_llama copied to clipboard
Costs and future
First of all, I would like to express my gratitude for making this available on such a permissive license, it opens new doors to both researchers and industries.
-
Could you provide a breakdown of the cost of training such a model? It's important for planification to know how to budget to train large models like this one for specific use cases and I am sure many of us would be glad to have some hard data to provide our managers.
-
Do you plan on training on more tokens than the original Llama paper or is the goal only to reproduce the results?
Thanks again for your terrific work!