KYY

Results 1 issues of KYY

https://github.com/seungeunrho/minimalRL/blob/master/dqn.py https://github.com/seungeunrho/minimalRL/blob/7597b9af94ee64536dfd261446d795854f34171b/dqn.py#L63 I am wondering why the `train` method is internally looping 10 times? Shouldn't the policy network train per action?