policy-gradient-pong
policy-gradient-pong copied to clipboard
Memory Leak?
First of all, thanks for this! It's a really nice re-implementation of Karpathy's original code.
The one issue I've been having is that when I run it over long periods, the memory usage builds up quite fast. After ~3 hours the memory usage is around 4GB, and after around 8 hours the memory usage was over 8GB.
It could just be me, but I was wondering if you have any idea what could be causing this?
Thanks!