llm-rlhf topic
List
llm-rlhf
repositories
llm_rlhf
24
Stars
1
Forks
Watchers
realize the reinforcement learning training for gpt2 llama bloom and so on llm model
ssbuild
llm
llm-rlhf
lora
reward