option-critic-pytorch Biased gradients

Biased gradients

Open manuel-delverme opened this issue 4 years ago • 1 comments

you need to re-evaluate the features/state after the optimization step optim.step() because that updates the feature layer hence the features themselves

Jun 29 '21 04:06 manuel-delverme

Hey @manuel-delverme. Do you mean moving lines 112-114

state = option_critic.get_state(to_tensor(next_obs))
option_termination, greedy_option = option_critic.predict_option_termination(state, current_option)

following the optimisation step?

Jun 29 '21 06:06 lweitkamp

Fixed as of the latest commit

Aug 18 '22 19:08 lweitkamp

option-critic-pytorch option-critic-pytorch copied to clipboard

Biased gradients

option-critic-pytorch
option-critic-pytorch copied to clipboard