stable-baselines3 [Question] TD3 algorithm， During training，why limit the next

[Question] TD3 algorithm， During training，why limit the next_actions

Open Danny551 opened this issue 1 year ago • 0 comments

❓ Question

TD3 algorithm， During training，why limit the next_actions？ If my action range is much larger than [-1,1], the data is truncated https://github.com/DLR-RM/stable-baselines3/blob/master/stable_baselines3/td3/td3.py#L171

Checklist

[X] I have checked that there is no similar issue in the repo
[X] I have read the documentation
[X] If code there is, it is minimal and working
[X] If code there is, it is formatted using the markdown code blocks for both code and stack traces.

Oct 15 '24 03:10 Danny551

stable-baselines3 stable-baselines3 copied to clipboard

[Question] TD3 algorithm， During training，why limit the next_actions

❓ Question

Checklist

stable-baselines3
stable-baselines3 copied to clipboard