random-network-distillation-pytorch issues

Intrinsic reward calculation, sum or mean?

2

Hi! I have a question related to how the intrinsic rewards are calculated. Why do you use the sum(1) instead of mean(1)? https://github.com/jcwleo/random-network-distillation-pytorch/blob/e383fb95177c50bfdcd81b43e37c443c8cde1d94/agents.py#L76 That would calculate the sum along the...

aklein1995

if i want to employe this work to a new env, what should i do?

Tanks for the great work! I'd like to konw if i want employe this work on the new continuous env which is created by myself, what should i do?Do you...

SOMEAIDI

training error

1

I find the code loads the pretrained weights in training. I tried to train without pretrained weight. But it seems a wrong operations. There is my result. ![image](https://user-images.githubusercontent.com/16297710/83408215-d8305900-a444-11ea-81e7-50ef1c42d909.png)

rainbow979

About sticky action

1

Hi, In your code (envs.py), I saw that you first use MaxAndSkipEnv() to wrap the environment, and then apply the sticky action. However, in [RND's author's code](https://github.com/openai/random-network-distillation), I found that...

tongzhoumu

How long did you get 6100?

8

Hello, I also built RND model, but I am stuck at 2500... How many total steps will agent improve further? I am not sure whether it is related to bug...

zhr211

jcwleo

RNN 모델 추가

jcwleo

enhancement

Action values are incremented by 1 for the Breakout game ?

Hi, Is the reason for the following code modifying the actions for the breakout game is eliminating the NOOP action from the available set of actions that can be taken...

cangozpi

random-network-distillation-pytorch
random-network-distillation-pytorch copied to clipboard

Metadata

Intrinsic reward calculation, sum or mean?

if i want to employe this work to a new env, what should i do?

training error

About sticky action

How long did you get 6100?

Reduce memory usage (2-3x times)

global_grad_norm_ has no effect

README asset

RNN 모델 추가

Action values are incremented by 1 for the Breakout game ?

← Metadata

Owner

Metadata

random-network-distillation-pytorch random-network-distillation-pytorch copied to clipboard

Metadata

← Metadata

Owner

Metadata

random-network-distillation-pytorch
random-network-distillation-pytorch copied to clipboard