stable-baselines3
stable-baselines3 copied to clipboard
[Question] TD3 algorithm, During training,why limit the next_actions
❓ Question
TD3 algorithm, During training,why limit the next_actions? If my action range is much larger than [-1,1], the data is truncated https://github.com/DLR-RM/stable-baselines3/blob/master/stable_baselines3/td3/td3.py#L171
Checklist
- [X] I have checked that there is no similar issue in the repo
- [X] I have read the documentation
- [X] If code there is, it is minimal and working
- [X] If code there is, it is formatted using the markdown code blocks for both code and stack traces.