rl issues

[Feature] ParallelEnv always creates and EnvCreator for compatibility with collectors

1

## Description When setting a ParallelEnv in a remote data collector and not embedding the env maker in an EnvCreator, an error is raised. This PR solves this issue.

vmoens

bug

CLA Signed

[DO NOT CLOSE] Call for contributions

This issue is a list of contributions requests from the community. ## How to use this list If you're willing to contribute to the library, have a look at the...

vmoens

enhancement

Good first issue

[Feature Request] Tutorial for custom env with complex shapes

1

### 🚀 The feature, motivation and pitch I am totally unable to create a `EnvBase` subclass, where the `*_spec` attribute have complex shapes. For example, I have a state with...

svnv-svsv-jm

enhancement

[NOMERG] Add @overload to forward in losses

3

Loss functions accept non-tensordict data thanks to the tensordict.dispatch decorator. However, we do not provide an overloaded forward, which could be useful to let users know about the typical signature...

vmoens

CLA Signed

[NOMERG] Prototyping tensorclass for losses

3

We project on using @tensorclass to represent losses. The advantage of tensorclass for losses instead of tensordict is that it will help us use all the features of tensordict while...

vmoens

CLA Signed

[Feature Request] partial steps in batches envs

1

## Motivation In many scenarios we need to perform a step only on a subset of batched envs. This includes collecting a complete trajectory for many envs when they end...

vmoens

enhancement

[Algorithm] QGNN mixer

1

Value function mixer using a GNN from https://arxiv.org/abs/2205.13005 missing: - [ ] docs - [ ] tests implemented by @acciorocketships

matteobettini

CLA Signed

new algo

[Feature Request] Muzero and MCTS implementations

1

## Motivation It would be great to have an MCTS and Alphazero implementation, including other model-based RL for benchmarking and comparison. ## Solution I can write a loss function of...

Prakyathkantharaju

enhancement

[Feature Request] Refactoring of TensorSpec and documentation

2

## Motivation When creating a custom environment, deciding the correct TensorSpec (TS) to use is not straightforward. Some TS are never used, some look very similar to each other and...

DavideTr8

enhancement

[WIP, CI] Pre-release submitit scripts

8

## Description In this PR, I propose a script to run all our benchmarks before the release. cc @matteobettini @albertbou92 @BY571 @giadefa TODO: - [ ] make sure Wandb logging...

vmoens

CLA Signed

CI

rl
rl copied to clipboard

Metadata

[Feature] ParallelEnv always creates and EnvCreator for compatibility with collectors

[DO NOT CLOSE] Call for contributions

[Feature Request] Tutorial for custom env with complex shapes

[NOMERG] Add @overload to forward in losses

[NOMERG] Prototyping tensorclass for losses

[Feature Request] partial steps in batches envs

[Algorithm] QGNN mixer

[Feature Request] Muzero and MCTS implementations

[Feature Request] Refactoring of TensorSpec and documentation

[WIP, CI] Pre-release submitit scripts

← Metadata

Owner

Metadata

rl rl copied to clipboard

Metadata

← Metadata

Owner

Metadata

rl
rl copied to clipboard