megablocks
megablocks copied to clipboard
SFT Script and Hyperparameters used for DBRX-Instruct
Hi, I saw you mentioned that you used your fork of Megatron-LM for training - could you please provide scripts and hyperparams used for the SFT of DBRX? It would mean the world for the OSS community!
At openchat, we'd like to fine-tune your model on our data and open source it.