OLMo
OLMo copied to clipboard
Document GPU hours for the ptrained models
📚 The doc issue
The documentation is very nice and complete with FLOP estimates, cluster characteristics, etc. There is just one small detail missing: how long (in days) did it take to train the model ?
Suggest a potential alternative/fix
No response