es_pytorch
es_pytorch copied to clipboard
Add noise to gradient update to encourage exploration
trafficstars
- Use a temperature param that decreases over time to control the size of the noise - similar to epsilon greedy