rl_a3c_pytorch icon indicating copy to clipboard operation
rl_a3c_pytorch copied to clipboard

plot rewards as a function of number of timesteps

Open rachit-dubey opened this issue 5 years ago • 1 comments

Hi, thanks so much for the excellent codebase. Just wondering, is there any way to plot the training curve as a function of timesteps (as opposed to plotting the training curve as a function of time passed)?

Thanks!

rachit-dubey avatar Oct 04 '18 16:10 rachit-dubey

As the training cases are in asynchronous manner. If you need such data and want accurate number and not hinder training performance then probably best to keep a tally in each parallel process. Then set up to combine total by sending that data to a newly created process that would do the actual summation of.

dgriff777 avatar Nov 01 '18 19:11 dgriff777