MSML icon indicating copy to clipboard operation
MSML copied to clipboard

SFT/test_data

Open leeparkuky opened this issue 4 months ago • 1 comments

Please share the SFT/test_data mentioned in the eval folder so that evaluation can be replicated for other choice of input models. In addition, please please share the SFT models so that I have my own sanity check of what's the difference between SFT and DPO models.

leeparkuky avatar Aug 18 '25 01:08 leeparkuky

SFT data is here https://huggingface.co/datasets/morganstanley/sft-python-q-problems-sft. We'll upload the 32B intermediate checkpoints (sft and non-reasoning rl) today and do the same for 7B models later. Thank you for the reminder.

yuriyn-msml avatar Aug 26 '25 17:08 yuriyn-msml