HACAModel icon indicating copy to clipboard operation
HACAModel copied to clipboard

Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.org/abs/1804.05448)

HACAModel

Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.org/abs/1804.05448)

Requirements:

Use example has been provided in slurm.sh