RLSeq2Seq
RLSeq2Seq copied to clipboard
Deep Reinforcement Learning For Sequence to Sequence Models
First of all, thanks for sharing the wonderful work. You did a great job. What I need to do is to apply the trained model on an article. I used...
https://github.com/yaserkl/RLSeq2Seq/blob/515a4cb709d7c7ac7af152b48b03b526a94470dd/src/model.py#L372-L381 I'm sorry if I understand the paper or the code in a wrong way, but according to my current understanding, the conditional probability in Equation 15 is the prob...
Hello! I can't understand this (389 - 407 line in run_summarization.py), why the "dqn_best_action" use state other than state_prime ? I think dist_q_val = -tf.log(dist) * q_value (model.py) which means...
Bumps [tensorflow-gpu](https://github.com/tensorflow/tensorflow) from 1.10 to 2.12.0. Release notes Sourced from tensorflow-gpu's releases. TensorFlow 2.12.0 Release 2.12.0 TensorFlow Breaking Changes Build, Compilation and Packaging Removed redundant packages tensorflow-gpu and tf-nightly-gpu. These...