pytorch-ddpg-naf issues

NAF Implementation not working!

2

The NAF algorithm does not work on Pendulum or any of the PyBullet environments. @ikostrikov Do you have any guesses why that might be the case? Which environments did you...

Akella17

t was not defined, I assumed to be equal to batch size and computing …

For adaptive noise estimation, need to get some states for the expectation operator and compute the ddpg distance metric between perturbed and non perturbed action but getting samples index "t"...

gsp-27

fixed NAF reward discount

Hi Ilya, Thanks for your open-source implementation of DDPG/NAF in pytorch. We spotted a typo in NAF: the discount factor (and the done mask) should multiply the next_state_values instead of...

jackokaiser

Parallel OpenAI environments

Hi, I was wondering if there is any particular reason why this repo doesn't use parallel environments like those in the a2c-ppo-acktr repo.

codeislife99

saving the trained model

Could you help saving the learnt model after each updates.

svd3

benchmarking the repo

1

Hi @ikostrikov , I appreciate your implementation, and I wonder if you've benchmarked your implementation? If so, can I have some roughly results. Many thanks!

andrewliao11

AttributeError for gradient clipping

hi ikostrikov, I got this error when running your code ``` Traceback (most recent call last): File "main.py", line 89, in agent.update_parameters(batch) File "/home/andrewliao11/Work/pytorch-naf/naf.py", line 121, in update_parameters param.grad.data.clamp(-1, 1)...

andrewliao11

pytorch-ddpg-naf
pytorch-ddpg-naf copied to clipboard

Metadata

NAF Implementation not working!

t was not defined, I assumed to be equal to batch size and computing …

fixed NAF reward discount

Parallel OpenAI environments

saving the trained model

benchmarking the repo

AttributeError for gradient clipping

← Metadata

Owner

Metadata

pytorch-ddpg-naf pytorch-ddpg-naf copied to clipboard

Metadata

NAF Implementation not working!

t was not defined, I assumed to be equal to batch size and computing …

fixed NAF reward discount

Parallel OpenAI environments

saving the trained model

benchmarking the repo

AttributeError for gradient clipping

← Metadata

Owner

Metadata

pytorch-ddpg-naf
pytorch-ddpg-naf copied to clipboard