off-policy topic

List off-policy repositories

hindsight-experience-replay

377
Stars
76
Forks
Watchers

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

exorl

98
Stars
8
Forks
Watchers

ExORL: Exploratory Data for Offline Reinforcement Learning

curl

559
Stars
88
Forks
Watchers

CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning

linorobot

399
Stars
71
Forks
Watchers

Autonomous ground robots (2WD, 4WD, Ackermann Steering, Mecanum Drive)

rad

399
Stars
71
Forks
Watchers

RAD: Reinforcement Learning with Augmented Data

sunrise

117
Stars
28
Forks
Watchers

SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning

off-policy-continuous-control

73
Stars
10
Forks
Watchers

Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)

flashbax

160
Stars
6
Forks
Watchers

⚡ Flashbax: Accelerated Replay Buffers in JAX

causal-rl

27
Stars
3
Forks
Watchers

Causal RL: Reverse-Environment Network Integrated Actor-Critic Algorithm

solving a simple 4*4 Gridworld almost similar to openAI gym FrozenLake using Qlearning Temporal difference method Reinforcement Learning