Florian Bauer

Results 2 comments of Florian Bauer

> Although, it won't be the most optimal solution memory-wise, it's possible to compute rewards of different actions from the environment at the same step by cloning the `env` object....

Hi @araffin, thanks for your review. > is indeed a hack that might work in some environments (the one that can be reset to an exact state) and seems to...