alignment-handbook icon indicating copy to clipboard operation
alignment-handbook copied to clipboard

Released model weights for ablations of KTO/IPO/DPO cannot be found

Open ChenDRAG opened this issue 1 year ago • 0 comments

Hi @edbeeching , thanks for the great work in ablating KTO/IPO/DPO algorithms in #104 . I notice that in this referenced blog, it says the best performing model for each algorithm has been uploaded to the collection page. However, I cannot find these models.

Could you kindly provide these model weights? Thank you in advance.

ChenDRAG avatar May 17 '24 06:05 ChenDRAG