llm-rlhf topic

List llm-rlhf repositories

llm_rlhf

24
Stars
1
Forks
Watchers

realize the reinforcement learning training for gpt2 llama bloom and so on llm model