rl_a3c_pytorch plot rewards as a function of number of timesteps

plot rewards as a function of number of timesteps

Open rachit-dubey opened this issue 5 years ago • 1 comments

Hi, thanks so much for the excellent codebase. Just wondering, is there any way to plot the training curve as a function of timesteps (as opposed to plotting the training curve as a function of time passed)?

Thanks!

Oct 04 '18 16:10 rachit-dubey

As the training cases are in asynchronous manner. If you need such data and want accurate number and not hinder training performance then probably best to keep a tally in each parallel process. Then set up to combine total by sending that data to a newly created process that would do the actual summation of.

Nov 01 '18 19:11 dgriff777

rl_a3c_pytorch rl_a3c_pytorch copied to clipboard

plot rewards as a function of number of timesteps

rl_a3c_pytorch
rl_a3c_pytorch copied to clipboard