alignment-handbook
alignment-handbook copied to clipboard
Released model weights for ablations of KTO/IPO/DPO cannot be found
Hi @edbeeching , thanks for the great work in ablating KTO/IPO/DPO algorithms in #104 . I notice that in this referenced blog, it says the best performing model for each algorithm has been uploaded to the collection page. However, I cannot find these models.
Could you kindly provide these model weights? Thank you in advance.