Show-Attend-and-Tell
Show-Attend-and-Tell copied to clipboard
Caption generation fails when using networks other than VGG19
RuntimeError: shape '[14, 14]' is invalid for input of size 49
is the error thrown. Naively reshaping the alphas to fit the input size causes weird cropping issues in the attention highlights.