weak-to-strong topic
List
weak-to-strong
repositories
aligner
104
Stars
5
Forks
Watchers
Achieving Efficient Alignment through Learned Correction
Aligner2024
aligner
alignment
llm
rlhf