tatk [Feature] [Policy] LARL

[Feature] [Policy] LARL

Open zqwerty opened this issue 5 years ago • 0 comments

Describe the feature Add LARL in Policy module and support multiwoz dataset.

Expected behavior Train and test on multiwoz dataset. Report component performance and end2end performance.

Additional context Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models

Nov 13 '19 13:11 zqwerty