Deep-Reinforcement-Learning-Algorithms-with-Pytorch issues

Results 3 Deep-Reinforcement-Learning-Algorithms-with-Pytorch issues

Sort by recently updated

trafficstars

Issuee of 2.1

Hello, this is the issue for file 2.1. Since I ran the program with the CPU, not CUDA, I changed the default to the CPU in the main dvc, and...

sunshine339

Always predict boundary action in TD3

Hi, I'm having problems with action prediction with TD3. The agent tends to predict boundary actions after it starts learning. Do you know what is causing this problem? ![image](https://github.com/user-attachments/assets/59fbc73b-0ecf-4f0e-bdda-61d98b7c9430)

Leong1230

关于SAC_Continuous的Actor部分的问题

在您这行代码中： https://github.com/XinJingHao/DRL-Pytorch/blob/aa17b796ba632f371863eeaa62406e183177dbae/5.2%20SAC-Continuous/utils.py#L47 为什么对于第一项的dist.log_prob(u)，在dim=1的维度上进行sum操作？按照原来的公式，似乎这一项并不是sum操作吧？请您指正！

lgmtxl

Deep-Reinforcement-Learning-Algorithms-with-Pytorch
Deep-Reinforcement-Learning-Algorithms-with-Pytorch copied to clipboard

Metadata

Issuee of 2.1

Always predict boundary action in TD3

关于SAC_Continuous的Actor部分的问题

← Metadata

Owner

Metadata

Deep-Reinforcement-Learning-Algorithms-with-Pytorch Deep-Reinforcement-Learning-Algorithms-with-Pytorch copied to clipboard

Metadata

Issuee of 2.1

Always predict boundary action in TD3

关于SAC_Continuous的Actor部分的问题

← Metadata

Owner

Metadata

Deep-Reinforcement-Learning-Algorithms-with-Pytorch
Deep-Reinforcement-Learning-Algorithms-with-Pytorch copied to clipboard