Thomas Coste
Results
1
repositories owned by
Thomas Coste
llm_optimization
21
Stars
0
Forks
Watchers
A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.
Thomas Coste
A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.