Adam Gleave comments

Results 172 comments of


                                            Adam Gleave

Add support for saving videos of policies on a environment for evaluation during and after training

Assigning to @samuelarnesen to get PR https://github.com/HumanCompatibleAI/imitation/pull/524 over finish line once he starts

Consistent interface for algorithms & single training script

> We should probably have an ingredient for each algorithm. This makes constructing more complex experiments (such as warm-starting) easier. Note `sacred.Experiment` is-a `sacred.Ingredient` (parent class), so I think we...

Consistent interface for algorithms & single training script

> E.g. right now we use the `train_imitation` to train `BC` as well as `DAgger`. If I am only interested in training `BC`, then I don't want to be bothered...

Consistent interface for algorithms & single training script

Thanks for clarifying @thequilo -- good to know someone else in your lab is likely to pick it up. And good luck wrapping up your PhD!

Consistent interface for algorithms & single training script

I'll try to look at this next week. @qxcv any thoughts on above, I think you've used Sacred a fair bit?

Consistent interface for algorithms & single training script

I'm in favor of trying to port one script to Hydra to try it out. Eliminating `parallel.py` would be nice, I wrote that as a temporary hacky solution and somehow...

Benchmark and replicate algorithm performance

@ernestum `benchmarking/util.py` cleans up the configs generated by automatic hyperparameter tuning; the output of this is the JSON config files in `benchmarking/`. My understanding of the outstanding issues are: -...

Benchmark and replicate algorithm performance

> Is there interest in this or is that a lower priority thing? Cleaning up and documenting `utils.py` seems worthwhile. Documenting how to generate the summary table also worthwhile. Although...

Inconsistent naming of expert demonstrations

Not sure how I missed this the first time around. I agree we should standardize, but some care needed here, as each of these names means a different thing or...

Inconsistent naming of expert demonstrations

> So I propose: > > 1. Decide if we want to call it **episodes**, **trajectories** or **rollouts**? Maybe look at the naming conventions of SB3. OTOH I'd vote for...