DRL-code-pytorch
DRL-code-pytorch copied to clipboard
The SAC algorithm seems not convergent
Just like the picture shows, I find the curve fluctuates -120. Actually, I did not change anything, so I am confused about the result.
请问改过来了吗