Multi-Modality-Arena issues

the QR code is invalid. how can i join the wechat?

1

Chatbot Arena conversation data

1

Hi, thanks for the efforts in the great work! I would like to ask whether you plan to open-source the Chatbot Arena conversation data. Thanks in advance! Best, Wei

wlin-at

always getting (error_code: 1)

2

Hello and thank you for your amazing work! However, I have a problem: the models are loaded well but I continue getting `NETWORK ERROR DUE TO HIGH TRAFFIC. PLEASE REGENERATE...

emanuelevivoli

Which test set for Flickr30k?

Wondering if you use the karpathy test set for Flickr30k, or a different test set in your LVLM-eHUB paper. Thanks!

sriniiyer

Hardware requirements

Hi all, Could anyone provide with the hardware requirements to run and test these models. I am planning to run these models on Local Systems It would be great if...

rishi1007

I want to join the wechat group mentioned in README, but the QR code is invalid, can I add wechat to join?

1

ererdewubudesi

How to reproduce the Tiny-eHub eval

Thanks for releasing this benchmark. Now we tried to compute the categorical score for each ability but found low scores on several abilities, like visual reasoning, and visual perception. We...

zhangmozhe

Hello, thanks for the great work! I was looking at [this script](https://github.com/OpenGVLab/Multi-Modality-Arena/blob/main/peng_utils/test_llava.py) for llava evaluation on Flickr30k, but am facing some issues, detailed [here](https://github.com/haotian-liu/LLaVA/issues/768). Could you please help me with...

devaansh100

Can not load scienceQA dataset.

I run the scripts on ScienceQA but it raises error: ''' File "./Multi-Modality-Arena/LVLM_evaluation/task_datasets/vqa_datasets.py", line 140, in load_save_dataset self.image_list.append(sample['image'].convert('RGB')) ^^^^^^^^^^^^^^^^^^^^^^^ AttributeError: 'dict' object has no attribute 'convert' '''

kai-wen-yang

Code for VCR evaluation

1

First, I really appreciate for your great contributions in LVLM field. Do you have any plan to release the visual commonsense reasoning (VCR) evaluation code? There's some elaboration about how...

dongmean

Multi-Modality-Arena
Multi-Modality-Arena copied to clipboard

Metadata

the QR code is invalid. how can i join the wechat?

Chatbot Arena conversation data

always getting (error_code: 1)

Which test set for Flickr30k?

Hardware requirements

I want to join the wechat group mentioned in README, but the QR code is invalid, can I add wechat to join?

How to reproduce the Tiny-eHub eval

LLaVA evaluation on Flickr30k

Can not load scienceQA dataset.

Code for VCR evaluation

← Metadata

Owner

Metadata

Multi-Modality-Arena Multi-Modality-Arena copied to clipboard

Metadata

← Metadata

Owner

Metadata

Multi-Modality-Arena
Multi-Modality-Arena copied to clipboard