LLaVA-HR
LLaVA-HR copied to clipboard

Published 20 hours ago •

Reame
Issues

Understanding how MR-Adapter works

Open SoroushMehraban opened this issue 11 months ago • 0 comments

Great work! May I know the intuitive reasons why the MR-Adapter is designed this way?

Why do we need Conv block for low resolution but MLP for high resolution?
What's the reason behind having this gate g in [-1, 1] before aggregation of high resolution features?
What's the reason of adding the original features Fvl in equation (3)? Is it to help the gradient flow?

Mar 12 '24 03:03 SoroushMehraban