cleanrl issues

Add rnd_ppo.py documentation and refactor

4

## Description Closes #127 ## Types of changes - [ ] Bug fix - [ ] New feature - [ ] New algorithm - [x] Documentation ## Checklist: - [x]...

yooceii

DQN on MountainCar

3

Details ## Problem Description Pytorch DQN fails on MountainCar. Try two settings in [the issue](https://github.com/vwxyzjn/cleanrl/issues/156) ## Checklist - [x] I have installed dependencies via `poetry install` (see [CleanRL's installation guideline](https://docs.cleanrl.dev/get-started/installation/)....

qsh-zh

Adding unit tests

## Problem Description A much requested inclusion in the library is to add unit tests. Among other things, "the key benefit of unit tests is to make sure the logic...

vwxyzjn

help wanted

Adding Double DQN

1

## Problem Description Hi I would like to add the double DQN algorithm to cleanrl. Can someone give me the go-ahead?

AshwinSankar17

Jax c51 contrib

2

## Description JAX implementation for C51 Implementation for #221 ## Types of changes - [ ] Bug fix - [ ] New feature - [x] New algorithm - [ ]...

kinalmehta

Adding Hierarchical RL Algorithms

4

Hi, I'm a PhD student doing work in hierarchical reinforcement learning (specifically [Option-critic-based algorithms](https://arxiv.org/abs/1709.04571)), and I've found this repository to be a particularly helpful starting point when trying to prototype...

DavidSlayback

Implement PPO-DNA algorithm for Atari

19

## Description Add implementation of PPO-DNA algorithm for Atari Envpool. ### Paper reproduction (attempt) Here's the episodic rewards after 200M environment steps (50M environment interactions before frame skip), compared to...

jseppanen

Add `rnd_ppo.py` documentation and refactor

5

`rnd_ppo.py` is a bit dated, and I recommend refactoring it to match other PPO style, which would include: - [x] change the name from `rnd_ppo.py` to `ppo_rnd.py` - [x] use...

vwxyzjn

Replace cloud utilities w/ `torchx`

## Problem Description Pytorch recently announced a universal job launcher called torchx (https://pytorch.org/torchx/latest/), which supports launching jobs at AWS batch, docker, k8s, and more. We should adopt `torchx`, which perfectly...

vwxyzjn

enhancement

Adding TRPO implementation

1

TRPO is a famous and powerful tool in RL. Although it does not have many practical uses these days, it is very helpful for a learner to read a good...

merak0514

enhancement

help wanted

cleanrl
cleanrl copied to clipboard

Metadata

Add rnd_ppo.py documentation and refactor

DQN on MountainCar

Adding unit tests

Adding Double DQN

Jax c51 contrib

Adding Hierarchical RL Algorithms

Implement PPO-DNA algorithm for Atari

Add `rnd_ppo.py` documentation and refactor

Replace cloud utilities w/ `torchx`

Adding TRPO implementation

← Metadata

Owner

Metadata

cleanrl cleanrl copied to clipboard

Metadata

← Metadata

Owner

Metadata

cleanrl
cleanrl copied to clipboard