Rousslan F.J. Dossa comments

Results 20 comments of


                                            Rousslan F.J. Dossa

WIP: SAC-discrete implementation

@timoklein Here is a snippet that would address @araffin's comments: https://github.com/vwxyzjn/cleanrl/pull/270#discussion_r1031332675 > why do you keep dim here as you flatten it in the next line? https://github.com/vwxyzjn/cleanrl/pull/270#discussion_r1031337608 > it's a...

WIP: SAC-discrete implementation

Indeed. Unlike the sac continuous, the output of the policy ia not fed through the Q functions for the actor loss, so joint loss optimization will not be required. ________________________________...

WIP: SAC-discrete implementation

> eps=1e-4 for Adam is required. Without this, there are seeds where SAC-d doesn't learn at all on Pong. This setting is also used in the [author's codebase](https://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch/blob/master/agents/actor_critic_agents/SAC.py). Has cost...

Rousslan F.J. Dossa

WIP: SAC-discrete implementation

WIP: SAC-discrete implementation

WIP: SAC-discrete implementation

WIP: SAC-discrete implementation

WIP: SAC-discrete implementation

WIP: SAC-discrete implementation

What is the reason for returning mean in SAC get_action function if it's never used?

Allow USB Debugging is not shown in the oculus

error install

Is it possible to change track without need to restart TORCS process?