AdaptiveAttention
AdaptiveAttention copied to clipboard
Question about Eq(8)
v represents the spatial image features, and I think it should be time-invariant, so why it has a subscript "t"?
Thanks.
Right... shouldn't it be just v_i?
I think it just keep its form the same as αti