DeCo
DeCo copied to clipboard
> Which part of the code do you need most urgently: DeCo training, inference, ckpts or the interpretability tool R_GAE? _Originally posted by @yaolinli in [#7](https://github.com/yaolinli/DeCo/issues/7#issuecomment-2608834930)_ I need the code...
你好请问什么时候开源呢?好久好久了
@yaolinli Q-Former needs to be retrained after changing to 2D average pooling, right?
LLaVA can't support visual grounding, how can you perform inference on RefCOCO?
May I ask when will it be open sourced?
A very impressive work for MLLM interpretability. I want to know how to compute the query-to-patch attention map (the top lines of Fig. 3) for linear projection (e.g., LLaVA), since...
Thank you for your outstanding work. I am very interested in your research. Could you please let me know when you expect to release the code for the model? I...
R-GAE
Hi, Thanks to your Solid work!I want to know how to calculate the R-GAE maps,especially the Query-to-patch. Could you please supply some key codes.
Thanks for your great work! I wanna know how you compute the raw token lens, just like the 729 in the image.