Kallinteris Andreas
Kallinteris Andreas
Your charts are wrong, for example `episodic_return` with "new reward" gets positive values, which is not possible as it is a sum of non positive values also there is no...
1. Are you using `continuing_task=True` (which is the default)? 2. Are you resetting about `termination=True`? 3. Have you experimented with other reward functions?
@onnoeberhard https://github.com/Farama-Foundation/Gymnasium-Robotics/blob/8606192735a9963d1dcc12feade037b77d9349be/gymnasium_robotics/envs/maze/maze_v4.py#L374-L381 You are correct, can you make a PR to fix it? You can use the Gymnasium/MuJoCo as reference https://gymnasium.farama.org/main/environments/mujoco/ant/#rewards
Hey, @leonasting basically yes, but also needs to be tested would you be interested on testing it and writing a PR, with a short report that shows terminal frames? @jjshoots...
@leonasting it not is very clear with this camera angle, try `camera_id=3` (argument in the `make` constructor)
@leonasting excellent I think it is clear that 1. below 0.8 the pen has fallen, and the hand can not interact with it in any way Now can you show...
@pseudo-rnd-thoughts If you have no further comments/requests, feel free to publish it. I am happy with the current version
env.unwrapped.P
https://gymnasium.farama.org/api/vector/#gymnasium.vector.VectorEnv.reset
The vector env does not support [barrier synchronization](https://en.wikipedia.org/wiki/Barrier_(computer_science)) functionality (nor do I believe it should) you can you simply serially run multiple environments