Dimitrios Tsaras
Dimitrios Tsaras
## Motivation There is currently no TensorSpec such that it allows sequential data. For example, if an environment was supposed to represent a sentence, it would be represented as (batch_size,...
## Describe the bug There is a bug when an EnvBase has a MultiDiscreteTensorSpec and moved to mps (I have not tested yet if the issue persists with cuda). ##...
## Describe the bug There seems to be a bug with performing a rollout when the tensordicts produced do not have the same shape. The rollout used to produce LazyStackedTensorDict,...
## Description I have added the Hindsight Experience Replay Transform specifically implementing the `future` and `last` strategy as described in the paper. The transform is a combination of 3 transforms:...