mlsh
mlsh copied to clipboard
Code for the paper "Meta-Learning Shared Hierarchies"
It seems the code for the `FourRooms` env was carried over from the option-critic implementation, and thus it also has a bug that the option-critic implementation had. See: https://github.com/jeanharb/option_critic/issues/11 Also,...
Hi, I read the paper and in the experiment section, apart from the first simple examples where it is trivial to determine the number of sub-policies, from section 6.4 (ant...
The total update counts on each process are different and this causes locking on MPI communication. To avoid it, unified the update counts.
https://github.com/openai/mlsh/blob/58f527ab7e3397eeb723a7309852b6d8791d5c24/mlsh_code/rollouts.py#L123 Hi, shouldn't the logic for determining terminal states for sub-policies consider the case where the master action changes? If the action changes, shouldn't we designate the current state as...
Just a small correction
Bumps [scipy](https://github.com/scipy/scipy) from 0.17.1 to 1.10.0. Release notes Sourced from scipy's releases. SciPy 1.10.0 Release Notes SciPy 1.10.0 is the culmination of 6 months of hard work. It contains many...