es_pytorch icon indicating copy to clipboard operation
es_pytorch copied to clipboard

Add noise to gradient update to encourage exploration

Open sash-a opened this issue 5 years ago • 0 comments
trafficstars

  • Use a temperature param that decreases over time to control the size of the noise - similar to epsilon greedy

sash-a avatar Sep 28 '20 13:09 sash-a