VILA About perception testset

About perception testset

Open mary-0830 opened this issue 1 year ago • 3 comments

Hello authors, Thanks for sharing fantastic jobs. Now I would like to ask where this dataset came from, can you share a link or data? "/lustre/fsw/portfolios/nvr/projects/nvr_elm_llm/dataset/video_datasets_v2/perception_test/"

May 08 '24 06:05 mary-0830

https://github.com/google-deepmind/perception_test

Specifically https://storage.googleapis.com/dm-perception-test/zip_data/valid_videos.zip

May 08 '24 07:05 Efficient-Large-Language-Model

https://github.com/google-deepmind/perception_test

Specifically https://storage.googleapis.com/dm-perception-test/zip_data/valid_videos.zip

Thank you for your quick reply! I have two questions I would like to ask.

Does the perception test not require gpt assistance for evaluation?
Why is the input in the model_vqa_videoperception.py different from other vqa inference evaluations? def get_model_option returns loss?

May 08 '24 08:05 mary-0830

yeah, it does not require gpt assistance
we followed the official repo to implemement the evaluation, you can refer to the offical repo

May 08 '24 14:05 Efficient-Large-Language-Model

VILA VILA copied to clipboard

About perception testset

VILA
VILA copied to clipboard