typewriter
typewriter copied to clipboard
[How-to] Learning by Playing – Solving Sparse Reward Tasks from Scratch
Newbie on coach, I would need some advise on how to approach the implementation of Learning by Playing – Solving Sparse Reward Tasks from Scratch
https://arxiv.org/abs/1802.10567 using multiple heads as they are supported.
I am just starting to play with coach and trying to approach an interesting task.
Any help would be welcomed.