painterner
Results
1
issues of
painterner
Hello! I can't understand this (389 - 407 line in run_summarization.py), why the "dqn_best_action" use state other than state_prime ? I think dist_q_val = -tf.log(dist) * q_value (model.py) which means...