ShiyangYan

Results 2 issues of ShiyangYan

Could you provide the evaluation code accordingly? Thank you very much.

In the original paper, they used rollout to get the intermediate rewards during sequence generation, in this codes, it seems the generator only gets rewards when the whole sequence is...