rl issues

Tutorial of implementing learning using torchrl in a PettingZoo environment

2

Is there a tutorial on how to implement an RL algorithm like PPO in a PettingZoo env using TorchRL. I used the PettingZooWrapper() function on the already created environment but...

AnastasiaPsarou

enhancement

[Feature] Lightning integration example

8

## Description This PR offers a convenient `lightning.pytorch.LightningModule` base class, from which one can inherit to be able to train a `torchrl` model using `lightning`. ## Motivation and Context This...

svnv-svsv-jm

enhancement

CLA Signed

[Algorithm] CrossQ

2

## Description Adding [CrossQ](https://openreview.net/pdf?id=PczQtTsTIX) ## Motivation and Context Why is this change required? What problem does it solve? If it fixes an open issue, please link to the issue here....

BY571

CLA Signed

new algo

[Feature Request] Return depth from RoboHiveEnv

## Motivation Simulated environments are capable of rendering depth images which can be very useful in downstream training as they enable 3D understanding of the task being performed. RoboHive environments,...

sriramsk1999

enhancement

[Performance] Faster target update using foreach

3

vmoens

CLA Signed

[WIP] add multiagentRNN

15

## Description Per #2003 adds multi-agent GRU and LSTMs to torchRL's multiagent modules. Modifies the MultiAgentNetBase class to take in multiple input tensors and output tensors, which allows these recurrent...

kfu02

enhancement

CLA Signed

[Feature] Split-trajectories and represent as nested tensor

3

TODO: - [x] Doc

vmoens

enhancement

CLA Signed

[BUG] VecNorm.to_observation_norm broken for multiple keys

1

## Describe the bug The VecNorm transform produces a bug when calling the to_observation_norm method with multiple keys ## To Reproduce ``` gym_env = GymEnv("MountainCarContinuous-v0", device='cpu') transformed_env = TransformedEnv(gym_env, VecNorm(in_keys=["observation",...

maxweissenbacher

bug

Correct construction of TensorDictReplayBuffer in DDP

8

Hello, I have a question regarding the construction of replay buffers in distributed training (DDP). Across multiple workers, I would like to use a single, large prioritized replay buffer. With...

patchmeifyoucan

[BUG] Problems with BatchedEnv on accelerated device with single envs on cpu

29

## Describe the bug When the batched env device is `cuda` the step count on the batched env seems completely off from what it should be. When the batches env...

skandermoalla

bug

rl
rl copied to clipboard

Metadata

Tutorial of implementing learning using torchrl in a PettingZoo environment

[Feature] Lightning integration example

[Algorithm] CrossQ

[Feature Request] Return depth from RoboHiveEnv

[Performance] Faster target update using foreach

[WIP] add multiagentRNN

[Feature] Split-trajectories and represent as nested tensor

[BUG] VecNorm.to_observation_norm broken for multiple keys

Correct construction of TensorDictReplayBuffer in DDP

[BUG] Problems with BatchedEnv on accelerated device with single envs on cpu

← Metadata

Owner

Metadata

rl rl copied to clipboard

Metadata

← Metadata

Owner

Metadata

rl
rl copied to clipboard