rl issues

[Feature Request] Purely functional loss objectives

5

## Motivation ### 1. Consistent style for `torch.nn.modules.loss.*Loss` In `torch.nn.modules.loss`, there are many `*Loss` subclassing `nn.Module`. The `Loss.__init__()` does not takes other `nn.Module`'s as arguments. And method `Loss.forward()` method is...

XuehaiPan

enhancement

[Tutorial] Beam search with GPT models

3

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #2673 * __->__ #2623

vmoens

CLA Signed

tutorials

[Feature Request] TorchRL for Python 3.13t

4

Hello, PyTorch recently added support for Python 3.13t, aka "no-gil" Python (https://github.com/pytorch/pytorch/issues/130249). Are there any plans to make TorchRL usable from this version? It would be nice if this could...

patchmeifyoucan

enhancement

[BUG] torch geometric layers not working in policy network

## Describe the bug I am trying to integrate torch geometric layers into my policy network and I think I am running into a variant of https://github.com/pytorch/rl/issues/1613 ## To Reproduce...

rerz

bug

[CI] Fix conda on windows

3

## Description Describe your changes in detail. ## Motivation and Context Why is this change required? What problem does it solve? If it fixes an open issue, please link to...

vmoens

CLA Signed

CI

[Tutorial] MCTS

3

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #2673 * #2623

vmoens

CLA Signed

[BUG] SyncDataCollector Crashes when init_random_frames=0 with a policy that is NOT random

2

## Describe the bug When yielding from a `SyncDataCollector` that uses a standard `Actor` (not a random policy) and `init_random_frames=0`, it crashes. ```python policy = Actor( agent, in_keys=["your_key"], out_keys=["action"], spec=train_env.action_spec,...

AlexandreBrown

bug

[BUG] The class MultiOneHot(OneHot) uses the "to_numpy" method from the OneHot class which do not support multionehot vectors.

## Describe the bug The class `MultiOneHot(OneHot)` defined at torchrl\data\tensor_specs.py **uses the "to_numpy" method from the OneHot class.** The method "to_numpy" from the OneHot class do not support translating multione...

morales0021

bug

[Feature] habitat env from config

3

## Description Describe your changes in detail. ## Motivation and Context Why is this change required? What problem does it solve? If it fixes an open issue, please link to...

vmoens

bug

enhancement

CLA Signed

[Feature Request] ActionDiscretizer scalar integration

2

## Motivation The `ActionDiscretizer` only gives the option of converting the `input_spec["full_action_spec"]` to `MultiCategorical` or `MultiOneHot`. This introduces a dimension into the shape: ``` MultiCategorical( shape=torch.Size([1]), space=BoxList(boxes=[CategoricalBox(n=4)]), dtype=torch.int64, domain=discrete) ```...

oslumbers

enhancement

rl
rl copied to clipboard

Metadata

[Feature Request] Purely functional loss objectives

[Tutorial] Beam search with GPT models

[Feature Request] TorchRL for Python 3.13t

[BUG] torch geometric layers not working in policy network

[CI] Fix conda on windows

[Tutorial] MCTS

[BUG] SyncDataCollector Crashes when init_random_frames=0 with a policy that is NOT random

[BUG] The class MultiOneHot(OneHot) uses the "to_numpy" method from the OneHot class which do not support multionehot vectors.

[Feature] habitat env from config

[Feature Request] ActionDiscretizer scalar integration

← Metadata

Owner

Metadata

rl rl copied to clipboard

Metadata

← Metadata

Owner

Metadata

rl
rl copied to clipboard