sparseml
sparseml copied to clipboard
Updates to enable ultrachat200k
Ultrachat200k has 2 splits for training, one for sft and another for dpo. As a result it doesn't have a "train" split per se. This PR allows for a train_sft alternative.