Chanwoong joo comments

Results 20 comments of


                                            Chanwoong joo

README asset

![image](https://user-images.githubusercontent.com/23333028/48773457-c37cec00-ed0a-11e8-8c20-f9c35effc42d.png)

README asset

![image](https://user-images.githubusercontent.com/23333028/50719328-de9a9400-10dd-11e9-8c8c-29f7709cdf1d.png)

README asset

@kslazarev Hi, I used that config. but only NumEnv is 128 and MaxStepPerEpisode is 4500. In paper, author did not announce Advantage Norm and Noisynet. so I disabled that config.

@kslazarev per_rollout and per_epi is not same scale. per_rollout means just one global update(enter agent.train_model()). but per_epi means Env’s one episode info that is one of parallel env. If one...

README asset

@kslazarev I want you to create an issue for each question. :)

global_grad_norm_ has no effect

@shuang-liu Could you briefly explain why this doesn't work?

How long did you get 6100?

My model got 6100 when it run 16k rollout. :)

How long did you get 6100?

how many env did you use? I used 128 env

How long did you get 6100?

It was also mentioned in the paper that performance was not very good. Why don't you learn more? Reinforcement learning is always a battle of patience. :)