pytorch-ddpg-naf icon indicating copy to clipboard operation
pytorch-ddpg-naf copied to clipboard

Implementation of algorithms for continuous control (DDPG and NAF).

Results 7 pytorch-ddpg-naf issues
Sort by recently updated
recently updated
newest added

The NAF algorithm does not work on Pendulum or any of the PyBullet environments. @ikostrikov Do you have any guesses why that might be the case? Which environments did you...

For adaptive noise estimation, need to get some states for the expectation operator and compute the ddpg distance metric between perturbed and non perturbed action but getting samples index "t"...

Hi Ilya, Thanks for your open-source implementation of DDPG/NAF in pytorch. We spotted a typo in NAF: the discount factor (and the done mask) should multiply the next_state_values instead of...

Hi, I was wondering if there is any particular reason why this repo doesn't use parallel environments like those in the a2c-ppo-acktr repo.

Could you help saving the learnt model after each updates.

Hi @ikostrikov , I appreciate your implementation, and I wonder if you've benchmarked your implementation? If so, can I have some roughly results. Many thanks!

hi ikostrikov, I got this error when running your code ``` Traceback (most recent call last): File "main.py", line 89, in agent.update_parameters(batch) File "/home/andrewliao11/Work/pytorch-naf/naf.py", line 121, in update_parameters param.grad.data.clamp(-1, 1)...