rl-training topic
List
rl-training repositories
Text-Summarizer-Pytorch
316
Stars
75
Forks
Watchers
Pytorch implementation of "A Deep Reinforced Model for Abstractive Summarization" paper and pointer generator network
Sim4Rec
43
Stars
1
Forks
Watchers
Simulator for training and evaluation of Recommender Systems
qa_metrics
59
Stars
7
Forks
59
Watchers
An easy python package to run quick basic QA evaluations. This package includes standardized QA evaluation metrics and semantic evaluation metrics: Black-box and Open-Source large language model promp...
Gym
463
Stars
31
Forks
463
Watchers
Build RL environments for LLM training