rl icon indicating copy to clipboard operation
rl copied to clipboard

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Results 254 rl issues
Sort by recently updated
recently updated
newest added

## Description I have added the Hindsight Experience Replay Transform specifically implementing the `future` and `last` strategy as described in the paper. The transform is a combination of 3 transforms:...

enhancement
CLA Signed

## Motivation Maniskill3 is gaining a lot of taction recently and it offers great features like parallel GPU vectorized environments, a great set of tasks from simpler to complex and...

enhancement

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #2763 * #2711 * #2709

CLA Signed

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #2763 * __->__ #2711 * #2709

enhancement
CLA Signed

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #2763 * #2711 * __->__ #2709

CLA Signed
Examples

## Motivation Currently we cannot use CUDNN based modules in loss modules as they are incompatible with vmap used in most of the losses. Particularly for RNN modules this leaves...

enhancement

## Describe the bug Maniskill3 crashes after env.rollout when transferring data to host (cuda to cpu). ```python for _ in tqdm(range(nb_iters), "Evaluation"): rollouts = self.eval_env.rollout( max_steps=self.env_max_frames_per_traj, policy=policy, auto_reset=False, auto_cast_to_device=False, tensordict=tensordict,...

bug

## Describe the bug When disabling logprob aggregation for a probabilistic actor you are supposed to pass a sequence of `log_prob_keys` as a parameter instead of a single `log_prob_key`. However,...

bug

## Motivation We should add tests for all losses with - [ ] tensors passed via `tensordict.nn.dispatch`. This should be accompanied by an example in the docstrings of how to...

enhancement
Good first issue

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #2688 * #2687 * #2665

CLA Signed