Can't reproduce the results of paper.

Open wenzhoulyu opened this issue 3 years ago • 0 comments

I find something strange in the results of the code. My results of navigation are as follows. The hyper-parameters are same with yours (Num_Agents =5).

3AA0E43E-8901-49A4-9436-5D08A49AA57E Besides, I have a question for the BCF. I see your only evaluate the ensemble policy during training. Do we need the prior controller to do evaluation ? Or just use the ensemble policy to give action?

Sep 03 '22 10:09 wenzhoulyu