llm-rlhf topic

List llm-rlhf repositories

llm_rlhf

27
Stars
2
Forks
Watchers

realize the reinforcement learning training for gpt2 llama bloom and so on llm model