Reinforcement-learning-with-tensorflow
Reinforcement-learning-with-tensorflow copied to clipboard

Published 20 hours ago •

Reame
Issues

请问actor-critic中的critic预测价值，可以设计为预测action value分布吗？

Open Hins opened this issue 5 years ago • 0 comments

然后取相应action的value计算v和v'

Jul 09 '20 06:07 Hins