attention-module
attention-module copied to clipboard
look at the ablation study part in the paper
look at the ablation study part in the paper
sum gets better performance than prod, maybe you can modify prod to sum.
Originally posted by @splinter22 in https://github.com/Jongchan/attention-module/issues/1#issuecomment-449644106