rl-training topic

List rl-training repositories

Text-Summarizer-Pytorch

316
Stars
75
Forks
Watchers

Pytorch implementation of "A Deep Reinforced Model for Abstractive Summarization" paper and pointer generator network

Sim4Rec

43
Stars
1
Forks
Watchers

Simulator for training and evaluation of Recommender Systems

qa_metrics

59
Stars
7
Forks
59
Watchers

An easy python package to run quick basic QA evaluations. This package includes standardized QA evaluation metrics and semantic evaluation metrics: Black-box and Open-Source large language model promp...

Gym

463
Stars
31
Forks
463
Watchers

Build RL environments for LLM training