RLzoo Results on Box2D environments

Results on Box2D environments

Open balasurajp opened this issue 4 years ago • 1 comments

I tried to benchmark the follwing environments ['BipedalWalker-v2', 'BipedalWalkerHardcore-v2', 'CarRacing-v0', 'LunarLander-v2', 'LunarLanderContinuous-v2'] using ['A3C', 'DDPG', 'TD3', 'SAC', 'PG', 'TRPO', 'PPO', 'DPPO'] algorithms. Most of the combinations failed to learn the task and didn't converge. Only (SAC, LunarLanderContinuous-v2) and (TD3, LunarLanderContinuous-v2) learnt the task sub-optimally. . Can someone address this issue?

Feb 09 '21 13:02 balasurajp

Hi, Did you use the default hyper-parameters provided in RLzoo? If so, we will take a look into this problem.

Jun 26 '21 02:06 quantumiracle

RLzoo RLzoo copied to clipboard

Results on Box2D environments

RLzoo
RLzoo copied to clipboard