open_lm
open_lm copied to clipboard
How to pretrain on DCLM-BASELINE
Thank you for your excellent work. If I want to use DCLM-BASELINE for pretraining and conduct a rigorous comparison with the DCLM-BASELINE 7B model, what hyper-parameters should I use? Could you provide the corresponding script? Thank you.