Deep-Reinforcement-Learning-Algorithms-with-Pytorch icon indicating copy to clipboard operation
Deep-Reinforcement-Learning-Algorithms-with-Pytorch copied to clipboard

关于SAC_Continuous的Actor部分的问题

Open lgmtxl opened this issue 1 year ago • 0 comments

在您这行代码中: https://github.com/XinJingHao/DRL-Pytorch/blob/aa17b796ba632f371863eeaa62406e183177dbae/5.2%20SAC-Continuous/utils.py#L47

为什么对于第一项的dist.log_prob(u),在dim=1的维度上进行sum操作?按照原来的公式,似乎这一项并不是sum操作吧? 请您指正!

lgmtxl avatar Aug 29 '24 10:08 lgmtxl