mlsh issues

Bug in the FourRooms env implementation

It seems the code for the `FourRooms` env was carried over from the option-critic implementation, and thus it also has a bug that the option-critic implementation had. See: https://github.com/jeanharb/option_critic/issues/11 Also,...

ankeshanand

How to determine the number of sub-policies?

Hi, I read the paper and in the experiment section, apart from the first simple examples where it is trivial to determine the number of sub-policies, from section 6.4 (ant...

kwea123

Unified update counts on each process to avoid locking on MPI communi…

The total update counts on each process are different and this causes locking on MPI communication. To avoid it, unified the update counts.

natsuki14

Terminal states logic for sub-policies

1

https://github.com/openai/mlsh/blob/58f527ab7e3397eeb723a7309852b6d8791d5c24/mlsh_code/rollouts.py#L123 Hi, shouldn't the logic for determining terminal states for sub-policies consider the case where the master action changes? If the action changes, shouldn't we designate the current state as...

ysaibhargav

dependencies

mlsh
mlsh copied to clipboard

Metadata

Bug in the FourRooms env implementation

How to determine the number of sub-policies?

Unified update counts on each process to avoid locking on MPI communi…

Terminal states logic for sub-policies

Add .DS_Store and *.pyc files to .gitignore and remove them

Small correction

Bump scipy from 0.17.1 to 1.10.0 in /gym

← Metadata

Owner

Metadata

mlsh mlsh copied to clipboard

Metadata

Bug in the FourRooms env implementation

How to determine the number of sub-policies?

Unified update counts on each process to avoid locking on MPI communi…

Terminal states logic for sub-policies

Add .DS_Store and *.pyc files to .gitignore and remove them

Small correction

Bump scipy from 0.17.1 to 1.10.0 in /gym

← Metadata

Owner

Metadata

mlsh
mlsh copied to clipboard