best-of-n topic
List
best-of-n repositories
llm_optimization
28
Stars
2
Forks
Watchers
A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.