Jiawei Gao
Jiawei Gao
Met the same problem... In my case, i checked my result in "pretrain_q.csv", and found it seem like the offline_training procedure didn't actually happen... I'm looking closely into the source...
This is my result for HalfCheetah, as you noted, "it learned nothing".  While the result shown in the paper looks like this:  I noticed that when creating the...
I have the same question here ...