best-of-n topic

List best-of-n repositories

llm_optimization

28
Stars
2
Forks
Watchers

A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.