Any plans to support DPO training?

Open xs1997zju opened this issue 1 year ago • 3 comments

Dec 20 '24 03:12 xs1997zju

Out of curiosity, what gaps are you seeing with DPO in torchtune (https://github.com/pytorch/torchtune/blob/main/docs/source/recipes/dpo.rst)?

E.g. multi-node support? anything else?

Dec 20 '24 03:12 awgu

context parallelism is one of the major features missing in torchtune.

Jul 29 '25 19:07 mistysesame

Is this an open issue? I'm working on an implementation that could be helpful, perhaps.

Nov 05 '25 04:11 plugyawn