HRL
HRL copied to clipboard
--policy has to ignore what comes after _
so we can only specify the number of the experiment, because it already ignores that in the version of the weights