preference-data topic
List
preference-data repositories
notus
159
Stars
14
Forks
Watchers
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach