marltoolbox
marltoolbox copied to clipboard
Upgrade to use RLLib master (almost v1.3).
Remove use lock_replay during training (must not use it in LTFT). Create submodule marltoolbox.utils.log. Move methods to summarize a model into an helper class. use before_init_loss instead of after_init (policy class factory arg).