RL-Learner-Lucky comments

Repositories
Issues
Comments

Results 4 comments of


                                            RL-Learner-Lucky

add PPO-HER

i can implenment it on dqn sucessfully,but failed with ppo

How can I train it on multi-GPU

`temp_grad2=[g + 0 for g in temp_grads] ` because temp_grads are tf.VariableSynchronization.ON_READ,this operation triggers on_read event,temp2_grad should be the agrregation value of all replicas in different devies.But in fact temp2_grad...

which CardDefs.xml do you use?

i run the code with default parameters,so it use the notbasicset files of yours. i can also finds some cards whose effects are not implemented.