CQL
CQL copied to clipboard
cannot reproduce results of adroit task hammer-cloned, relocate-human and relocate-cloned
I choose lagrange_thresh=5, policy_lr=3e-5, min_q_version=2, min_q_weight=1, max_q_backup=False
But only get -100 on hammer-cloned, 4 on relocate-human, -18 on relocate-cloned.
The paper reports 730, 14, and -4 separately. Do I miss other details?
Have you find the hyper-parameters to reproduce the result in Adroit?