Audio-Visual-Video-Caption
Audio-Visual-Video-Caption copied to clipboard
MultiLevel Attention
Hi, Why are the multilevel attentions being used during encoding? They are used only during decoding according to the paper about Multimodal attention..