stog
stog copied to clipboard
Are the source-side and target-side vocabularies shared?
If not, how to add the src-side and tgt-side attention prob distributions together?