rainbow
rainbow copied to clipboard
Add Prioritized Experience Replay for Rainbow implementation
Should we work upon adding Prioritized Experience Replay ?
yeah you are more than welcome to implement that. I had no access to GPU recently and just get one. But I personally may not give this high priority since a2c (https://github.com/hengyuan-hu/a2c) learns much faster.
Why does a2c learn much faster? Should rainbow be the fastest overall?
Hi @hengyuan-hu , you willing to merge the request on PER ? Or do you wish we should conduct more experiments ?