marltoolbox Upgrade to use RLLib master (almost v1.3).

Upgrade to use RLLib master (almost v1.3).

Open Manuscrit opened this issue 3 years ago • 0 comments

Remove use lock_replay during training (must not use it in LTFT). Create submodule marltoolbox.utils.log. Move methods to summarize a model into an helper class. use before_init_loss instead of after_init (policy class factory arg).

Apr 15 '21 13:04 Manuscrit

marltoolbox marltoolbox copied to clipboard

Upgrade to use RLLib master (almost v1.3).

marltoolbox
marltoolbox copied to clipboard