evolution-strategies-starter issues

Doesn't the gradient need to be rescaled with σ ?

4

According to the paper on page 3, Algorithm 2, the gradient in line 11 is rescaled by the standard deviation. However I can't see it in the code in: https://github.com/openai/evolution-strategies-starter/blob/master/es_distributed/es.py#L247...

pzdkn

How to decide values for noise sd and number of samples per gradient estimation?

Obviously very problem dependent as usual, but for the noise of the perturbations this is set to the 0.02 standard dev in the config (also in the paper apparently we...

ben-arnao

why batched_weighted_sum?

2

Hello, why use batched_weighted_sum instead of direct dot on all items?

joyousrabbit

Virtual batch Normalization

3

If I understand the code correctly, it uses virtual batch normalization only for the inputs and **not** for the intermediate layers. Was this done in the Atari context for getting...

sahiliitm

fix bug for tf.concat api

tf.concat(concat_dim, values, ) for tensorflow 1.0+, the api tf.concat changed

harpsword

Why design ac_noise?

1

hi, Thanks for code sharing! after reading the source code, l have such questions, could you help me better understand the code? 1. why design ac_noise? rather than deterministic action...

fiberleif

Comment

I've been posting some ideas about evolution around the internet, so why not here! “ Quantization is the enemy of evolution It is fortunate that biological systems are heavily quantized,...

ghost

non-EC2 version?

2

Thank you very much for this great job! I was wondering if there exists a local version of this implementation that doesn't need EC2 to run. It would make it...

benmodels

Config for non-mujoco users

1

Related to #5 but to say that any config that doesn't assume a mujoco license would be great (including AMI-packing step if that is relevant).

danbri

Intuitions on implementing ES on text data for text classification use-cases

Hi, I would like to implement ES on text data can you please provide some intuitions on implementing ES for text classification use-cases. Possibly should I injects noise directly in...

goodrahstar

evolution-strategies-starter
evolution-strategies-starter copied to clipboard

Metadata

Doesn't the gradient need to be rescaled with σ ?

How to decide values for noise sd and number of samples per gradient estimation?

why batched_weighted_sum?

Virtual batch Normalization

fix bug for tf.concat api

Why design ac_noise?

Comment

non-EC2 version?

Config for non-mujoco users

Intuitions on implementing ES on text data for text classification use-cases

← Metadata

Owner

Metadata

evolution-strategies-starter evolution-strategies-starter copied to clipboard

Metadata

← Metadata

Owner

Metadata

evolution-strategies-starter
evolution-strategies-starter copied to clipboard