Multi-Modality-Arena icon indicating copy to clipboard operation
Multi-Modality-Arena copied to clipboard

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP...

Results 13 Multi-Modality-Arena issues
Sort by recently updated
recently updated
newest added

Hi, I'm the author of the paper [Aligning Large Multi-Modal Model with Robust Instruction Tuning](https://github.com/FuxiaoLiu/LRV-Instruction) and want to add our model to your amazing arena. May I know you email...

Nice work! Interested in the design of 1 vs 1 battles between LVLMs, but can you share more details about the Elo rating algorithm? Like the choice of k-factor, the...

Hi! I'm a fan of your work. Can you please provide more details about how to do eval for MiniGPT-4 and LLaVA on various datasets? Thanks a lot!