PPO Performance Degredation from 1.4.0 to 1.4.3
Description
Hey there!
Just wanted to report on something I am observing, and perhaps gain some understanding as to why this is happening.
I am using SKRL with IsaacLab:
- previously I was using SKRL 1.4.0 and an older version of IsaacLab (1.4)
- now I am using SKRL 1.4.3 with the latest version of IsaacLab
I am noticing a large difference in the performance of PPO on a certain task. I would love some insight into if this difference comes from the SKRL update, or not.
The first image (light blue line) shows the results of PPO on the task Isaac-Ant-v0 using my previous configuration (SKRL 1.4.0)
The second image (dark blue line) shows the results of PPO on the task Isaac-Ant-v0 using my current configuration (SKRL 1.4.3)
Looking forward to discussing this, Thanks!!!
P.S. If this is not an issue but rather something else, please let me know and happy to change this.