LLaVA-HR icon indicating copy to clipboard operation
LLaVA-HR copied to clipboard

Understanding how MR-Adapter works

Open SoroushMehraban opened this issue 11 months ago • 0 comments

Great work! May I know the intuitive reasons why the MR-Adapter is designed this way?

  • Why do we need Conv block for low resolution but MLP for high resolution?
  • What's the reason behind having this gate g in [-1, 1] before aggregation of high resolution features?
  • What's the reason of adding the original features Fvl in equation (3)? Is it to help the gradient flow?

SoroushMehraban avatar Mar 12 '24 03:03 SoroushMehraban