preference-alignment topic
List
preference-alignment repositories
KnowPAT
183
Stars
16
Forks
Watchers
[Paper][ACL 2024 Findings] Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering
SimPO
662
Stars
41
Forks
Watchers
SimPO: Simple Preference Optimization with a Reference-Free Reward
Dense_Reward_T2I
26
Stars
0
Forks
Watchers
Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).
beta-DPO
23
Stars
0
Forks
Watchers
[NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$