tatk
tatk copied to clipboard
[Feature] [Policy] LARL
Describe the feature Add LARL in Policy module and support multiwoz dataset.
Expected behavior Train and test on multiwoz dataset. Report component performance and end2end performance.
Additional context Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models