option-critic-pytorch icon indicating copy to clipboard operation
option-critic-pytorch copied to clipboard

Biased gradients

Open manuel-delverme opened this issue 4 years ago • 1 comments

you need to re-evaluate the features/state after the optimization step optim.step() because that updates the feature layer hence the features themselves

manuel-delverme avatar Jun 29 '21 04:06 manuel-delverme

Hey @manuel-delverme. Do you mean moving lines 112-114

state = option_critic.get_state(to_tensor(next_obs))
option_termination, greedy_option = option_critic.predict_option_termination(state, current_option)

following the optimisation step?

lweitkamp avatar Jun 29 '21 06:06 lweitkamp

Fixed as of the latest commit

lweitkamp avatar Aug 18 '22 19:08 lweitkamp