pureples
pureples copied to clipboard
Question about run_hyper()
Hi, first of all thank you for your library, it's great!
I am going through the code trying to understand what each step does, regarding the pole balancing environment. There is a point that really leaves me confused:
in run_hyper(), it seems we create the population and test it for one trial, then again for 10 trials, and then for max_trials trials.
Any reason to do that?
Thanks
Hi - thanks!
Could you specify the file and line? There is a difference between trials and episodes if that's what's confusing.