Thomas Coste

Results 1 repositories owned by Thomas Coste

llm_optimization

21
Stars
0
Forks
Watchers

A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.