rl issues

[BUG] It's not clear how to call an advantage module with batched envs and pixel observations.

7

## Describe the bug When you get a tensordict rollout of shape `(N_envs, N_steps, C, H, W)` out of a collector and you want to apply an advantage module that...

skandermoalla

bug

[Examples] Adding a tensordict and TorchRL version of the PyTorch example

10

## Description Adding a torchdict and torch rl version of the PyTorch example reinforcement_q_learning.py ## Motivation and Context Adds a simpler DQN example - [ ] I have raised an...

whatdhack

documentation

CLA Signed

Examples

[Documentation] README rewrite and broken links

3

vmoens

documentation

CLA Signed

[Feature] Enable parameter reset in loss module

4

## Description Allows to reset the parameters in the loss module.

BY571

enhancement

CLA Signed

[Feature] "fork" start method for mutli-collectors

3

vmoens

CLA Signed

[Feature] adding tensor classes annotation for loss functions

4

## Description followup from this [pull request](https://github.com/pytorch/rl/pull/1892) copy past: We project on using https://github.com/Tensorclass to represent losses. The advantage of tensorclass for losses instead of tensordict is that it will...

SandishKumarHN

enhancement

CLA Signed

[Feature] Faster RNNs (no split)

3

```python from torchrl.modules import LSTM import torch from torch.utils.benchmark import Timer from torchrl.modules.tensordict_module.rnn import _get_num_per_traj_init, _split_and_pad_sequence b = 10 t = 100 c = 32 device = "cuda" with torch.device(device):...

vmoens

enhancement

CLA Signed

[BUG] csv logger can't take "/" in metric names

## Describe the bug If the metric one is trying to log with the csv logger has `/` in its name, you will get a `No such file or directory`...

ran-weii

bug

[Feature Request] Multi-Agent RNNs

## Motivation In the original MAPPO paper, the authors claim an RNN-based actor beats the standard MLP actor, however torchRL currently has no recurrent multi-agent networks. Also suggested here by...

kfu02

enhancement

Remove the need for a dummy environment when instantiating a MultiSyncDataCollector.

## Motivation Currently we must instantiate an instance of the environment we wish to solve before creating a `MultiSyncDataCollector` object. This is because we can't create a policy without knowing...

AechPro

enhancement

rl
rl copied to clipboard

Metadata

[BUG] It's not clear how to call an advantage module with batched envs and pixel observations.

[Examples] Adding a tensordict and TorchRL version of the PyTorch example

[Documentation] README rewrite and broken links

[Feature] Enable parameter reset in loss module

[Feature] "fork" start method for mutli-collectors

[Feature] adding tensor classes annotation for loss functions

[Feature] Faster RNNs (no split)

[BUG] csv logger can't take "/" in metric names

[Feature Request] Multi-Agent RNNs

Remove the need for a dummy environment when instantiating a MultiSyncDataCollector.

← Metadata

Owner

Metadata

rl rl copied to clipboard

Metadata

← Metadata

Owner

Metadata

rl
rl copied to clipboard