LLaVA How can we get the attention weights of the input tokens during the inference [Discussion]

How can we get the attention weights of the input tokens during the inference [Discussion]

Open Occupying-Mars opened this issue 1 year ago • 0 comments

Discussion

Been trying to experiment with Llava and i wanted to get attention weights of the input tokens and since it uses llamaspadattetnion + the monkeypatch i wanted to know how we can get the attention weights of the input tokens (both for text and image) during the inference (new and learning about pytorch pardon if this is a dumb issue)

Feb 26 '24 23:02 Occupying-Mars

LLaVA LLaVA copied to clipboard

How can we get the attention weights of the input tokens during the inference [Discussion]

Discussion

LLaVA
LLaVA copied to clipboard