rlberry Default value for eval

Default value for eval_horizon

Open TimotheeMathieu opened this issue 2 years ago • 7 comments

Should the default for eval_horizon be 500 ?

Jul 12 '23 13:07 TimotheeMathieu

I'd keep it as large as possible (as now) and put a time limit in the environment, if necessary. To avoid hiding these choices from the user.

Jul 13 '23 12:07 omardrwch

Yes it should be 500 but could be change, it does not really matter.

Jul 13 '23 15:07 KohlerHECTOR

Yes it should be 500 but could be change, it does not really matter.

Why should this be 500? @KohlerHECTOR

Jul 14 '23 14:07 omardrwch

This should be 500 because 500 is the default for all control gym environment and is used in most benchmarks of control environments. This may be a deep rl thing. I think that there is no default in tabular rl so I think it is best to just go with the default that exists in deep rl.

Jul 14 '23 15:07 TimotheeMathieu

If the gym environment has already a time limit (at 500), any eval_horizon > 500 will do the job. So I'd keep as large as possible by default. Some atari environments have pretty huge horizons (~30k).

Jul 14 '23 15:07 omardrwch

@omardrwch Sorry for the authoritarian closing. I think indeed 500 is some kind of industry standard let us say. But in any case, this could be changed by the user when they code their experiments. Plus evaluation is pretty costly so on the contrary I would keep it as low as possible :) I guess in a dream world, we would have some config files with suggested values for n_steps n_evals eval_horizonfor different envs :)

Jul 17 '23 09:07 KohlerHECTOR

No worries! Ok for 500, but then let's put warning if we've reached 500 and the episode is not terminated.

Jul 17 '23 09:07 omardrwch

rlberry rlberry copied to clipboard

Default value for eval_horizon

rlberry
rlberry copied to clipboard