random-network-distillation-pytorch
random-network-distillation-pytorch copied to clipboard
Random Network Distillation pytorch
Hi! I have a question related to how the intrinsic rewards are calculated. Why do you use the sum(1) instead of mean(1)? https://github.com/jcwleo/random-network-distillation-pytorch/blob/e383fb95177c50bfdcd81b43e37c443c8cde1d94/agents.py#L76 That would calculate the sum along the...
Tanks for the great work! I'd like to konw if i want employe this work on the new continuous env which is created by myself, what should i do?Do you...
I find the code loads the pretrained weights in training. I tried to train without pretrained weight. But it seems a wrong operations. There is my result. 
Hi, In your code (envs.py), I saw that you first use MaxAndSkipEnv() to wrap the environment, and then apply the sticky action. However, in [RND's author's code](https://github.com/openai/random-network-distillation), I found that...
Hello, I also built RND model, but I am stuck at 2500... How many total steps will agent improve further? I am not sure whether it is related to bug...
README asset

Hi, Is the reason for the following code modifying the actions for the breakout game is eliminating the NOOP action from the available set of actions that can be taken...