preference-alignment topic

List preference-alignment repositories

KnowPAT

183
Stars
16
Forks
Watchers

[Paper][ACL 2024 Findings] Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering

SimPO

662
Stars
41
Forks
Watchers

SimPO: Simple Preference Optimization with a Reference-Free Reward

Dense_Reward_T2I

26
Stars
0
Forks
Watchers

Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).

beta-DPO

23
Stars
0
Forks
Watchers

[NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$