random-network-distillation-pytorch
random-network-distillation-pytorch copied to clipboard

Published 20 hours ago •

→

Metadata

Random Network Distillation pytorch

Readme
Issues

Random Network Distillation

Intrinsic Reward Graph with play

Venture	Montezuma's Revenge

~	New model for Montezuma

[x] Advantage Actor critic [1]
[x] Parallel Advantage Actor critic [2]
[x] Exploration by Random Network Distillation [3]
[x] Proximal Policy Optimization Algorithms [4]

1. Setup

Requirements

2. How to Train

Modify the parameters in config.conf as you like.

python train.py

3. How to Eval

python eval.py

4. Loss/Reward Graph

Montezuma's Revenge Env
Venture Env

References

[1] Actor-Critic Algorithms
[2] Efficient Parallel Methods for Deep Reinforcement Learning
[3] Exploration by Random Network Distillation
[4] Proximal Policy Optimization Algorithms

← Metadata

235

Stars

41

Forks

Watchers

Owner

Metadata

Random Network Distillation pytorch