Deep-Reinforcement-Learning-Algorithms-with-PyTorch Question on SAC implementation

Question on SAC implementation

Open fokx opened this issue 4 years ago • 0 comments

trafficstars

In SAC.py Line 120 https://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch/blob/b338c87bebb672e39304e47e0eed55aeb462b243/agents/actor_critic_agents/SAC.py#L120 However, the output of produce_action_and_action_info(state) is https://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch/blob/b338c87bebb672e39304e47e0eed55aeb462b243/agents/actor_critic_agents/SAC.py#L135 So, even though SAC algorithm can work in practice, is it a mistake?

Nov 15 '21 04:11 fokx

Deep-Reinforcement-Learning-Algorithms-with-PyTorch Deep-Reinforcement-Learning-Algorithms-with-PyTorch copied to clipboard

Question on SAC implementation

Deep-Reinforcement-Learning-Algorithms-with-PyTorch
Deep-Reinforcement-Learning-Algorithms-with-PyTorch copied to clipboard