async-rl icon indicating copy to clipboard operation
async-rl copied to clipboard

[WIP] Fixups

Open BlGene opened this issue 9 years ago • 3 comments

This is a collection of my fixups.

BlGene avatar Jun 23 '16 00:06 BlGene

Hi,

Thanks for posting this!

I was wondering if you were going to push a updated version? I would be intersted to know if running on GPU substantially increases performance for Atari. Also, is running 36 processes is always better than running 16, I don't recall the paper addressing this.

(If you have any ideas for how to improve the code that you haven't had time to try yourself I would be interested in these.)

BR, Max

BlGene avatar Jun 23 '16 22:06 BlGene

Thanks for nice fixes! I'll merge it after checking.

I was wondering if you were going to push a updated version?

Yes, I have had some refactoring and implemented training for gym environments and continuous tasks (not so successful so far), but I don't have enough time to push them.

I would be intersted to know if running on GPU substantially increases performance for Atari

I'm also interested in it, but using GPU would be tricky for my multi-process implementation. I suppose there's no way to share GPU memory among different processes.

Also, is running 36 processes is always better than running 16, I don't recall the paper addressing this.

I didn't compare scores of 16 vs 36. My implementation is apparently slower than DeepMind's, and in order to complete the same number of training steps in one day I needed to use more processes.

muupan avatar Jun 24 '16 08:06 muupan

Hi @muupan,

I updated the PR to fix the fact that all processes were starting with the same random number ( and a bit more).

BR, Max

BlGene avatar Jul 08 '16 23:07 BlGene