random-network-distillation-pytorch icon indicating copy to clipboard operation
random-network-distillation-pytorch copied to clipboard

Random Network Distillation pytorch

Results 11 random-network-distillation-pytorch issues
Sort by recently updated
recently updated
newest added
trafficstars

Hi! I have a question related to how the intrinsic rewards are calculated. Why do you use the sum(1) instead of mean(1)? https://github.com/jcwleo/random-network-distillation-pytorch/blob/e383fb95177c50bfdcd81b43e37c443c8cde1d94/agents.py#L76 That would calculate the sum along the...

Tanks for the great work! I'd like to konw if i want employe this work on the new continuous env which is created by myself, what should i do?Do you...

I find the code loads the pretrained weights in training. I tried to train without pretrained weight. But it seems a wrong operations. There is my result. ![image](https://user-images.githubusercontent.com/16297710/83408215-d8305900-a444-11ea-81e7-50ef1c42d909.png)

Hi, In your code (envs.py), I saw that you first use MaxAndSkipEnv() to wrap the environment, and then apply the sticky action. However, in [RND's author's code](https://github.com/openai/random-network-distillation), I found that...

Hello, I also built RND model, but I am stuck at 2500... How many total steps will agent improve further? I am not sure whether it is related to bug...

![image](https://user-images.githubusercontent.com/23333028/48664090-cf836680-eadc-11e8-969b-5201db99907d.png)

Hi, Is the reason for the following code modifying the actions for the breakout game is eliminating the NOOP action from the available set of actions that can be taken...