Reinforcement-learning-with-tensorflow icon indicating copy to clipboard operation
Reinforcement-learning-with-tensorflow copied to clipboard

请问actor-critic中的critic预测价值,可以设计为预测action value分布吗?

Open Hins opened this issue 5 years ago • 0 comments

然后取相应action的value计算v和v'

Hins avatar Jul 09 '20 06:07 Hins