es_pytorch issues

Results 7 es_pytorch issues

Sort by recently updated

trafficstars

Reset to best performer when stuck

- When an individual becomes stuck, reset params to the best performer seen yet

sash-a

enhancement

Recombination as an exploration mechanism

- Try both types of recombination as a means to _reset_ the params and explore a different area

sash-a

enhancement

Noise std strategy

- Self adaptive theta as seen in Evolutionary strategies a comprehensive introduction - Use multiple different theta values for each noise ind. This allows one to search along the trajectory...

sash-a

enhancement

Investigate novelty search

From testing novelty search seems to be doing as it is programmed, but in practice it performs poorly. Options: - [x] compare to openai's method of calculating the novelty metric...

sash-a

bug

Pull toward best ever performer

Move params in direction of best ever reward. Needs to be done with population

sash-a

enhancement

Add in ref batch for virtual batch norm

sash-a

enhancement

Add noise to gradient update to encourage exploration

* Use a temperature param that decreases over time to control the size of the noise - similar to epsilon greedy

sash-a

enhancement

es_pytorch
es_pytorch copied to clipboard

Metadata

Reset to best performer when stuck

Recombination as an exploration mechanism

Noise std strategy

Investigate novelty search

Pull toward best ever performer

Add in ref batch for virtual batch norm

Add noise to gradient update to encourage exploration

← Metadata

Owner

Metadata

es_pytorch es_pytorch copied to clipboard

Metadata

Reset to best performer when stuck

Recombination as an exploration mechanism

Noise std strategy

Investigate novelty search

Pull toward best ever performer

Add in ref batch for virtual batch norm

Add noise to gradient update to encourage exploration

← Metadata

Owner

Metadata

es_pytorch
es_pytorch copied to clipboard