LAVT-RIS
LAVT-RIS copied to clipboard
a mistake in class SpatialImageLanguageAttention
sim_map = sim_map + (1e4*l_mask - 1e4) # assign a very small number to padding positions I guess the 'small number' should be 1e-4?