Katsuki Ohto
Katsuki Ohto
VTrace looks working well in experiments with Geister.
Hi, @Jogima-cyber ! Long time no see, and thanks for your great suggestion! I also tried the multiple action situation this year, so I'll check if it can be put...
@Jogima-cyber I pushed **feature/multi_unit** branch into my fork. https://github.com/DeNA/HandyRL/compare/master...YuriCat:HandyRL:feature/multi_unit In this branch, each agent can output `env.num_units()` actions in each turn. I've confirmed that it works with TicTacToe!
@Jogima-cyber > unit_mask Indeed! We need `unit_mask` if there are absent units.
@Jogima-cyber `.unsqueeze(-1)` may be necessary for either of the two.
@Jogima-cyber Updated my branch [feature/numti_unit](https://github.com/DeNA/HandyRL/compare/master...YuriCat:HandyRL:feature/multi_unit).