Albert Bou

Results 7 issues of Albert Bou

I have seen that, by default, all instances of the obstacle tower environment I start run in the primary GPU (in my case GPU:0). Is it possible to specify in...

## Motivation When RNN’s are used in isolation, creating a TensorDictPrimer Transform for the environment to populate the TensorDicts with the expected tensors is pretty straightforward: ```python from torchrl.modules import...

enhancement

## Description Currently TorchRL has a CenterCrop transform. I have recently encountered a situation were I did not want to crop the center, but another part of the obs image....

enhancement
CLA Signed

In Python >= 3.9, `import safe` results in `ImportError: cannot import name 'gcd' from 'fractions' ` I believe it is related to the `math` package https://stackoverflow.com/questions/66174862/import-error-cant-import-name-gcd-from-fractions

## Description This PR adapts the SAC objective to be compatible with composite distributions. DiscreteSAC seems a little bit more tricky, as it requires computing the log probabilities of all...

CLA Signed

## Describe the bug I see very cool advancements in the direction of LLM RL training in the repo, awesome work! :) After playing a bit with the LLMEnv I...

bug

I am trying to speedup inference and training of a `mistralai/Mistral-Small-3.1-24B-Instruct-2503` model. Simply replacing `AutoModelForCausalLM` with `AutoLigerKernelForCausalLM` does not lead to any speedup in my sampling speed or memory usage....