Florian Bauer
Results
2
comments of
Florian Bauer
> Although, it won't be the most optimal solution memory-wise, it's possible to compute rewards of different actions from the environment at the same step by cloning the `env` object....
Hi @araffin, thanks for your review. > is indeed a hack that might work in some environments (the one that can be reset to an exact state) and seems to...