VILA icon indicating copy to clipboard operation
VILA copied to clipboard

About perception testset

Open mary-0830 opened this issue 1 year ago • 3 comments

Hello authors, Thanks for sharing fantastic jobs. Now I would like to ask where this dataset came from, can you share a link or data? "/lustre/fsw/portfolios/nvr/projects/nvr_elm_llm/dataset/video_datasets_v2/perception_test/"

mary-0830 avatar May 08 '24 06:05 mary-0830

https://github.com/google-deepmind/perception_test

Specifically https://storage.googleapis.com/dm-perception-test/zip_data/valid_videos.zip

https://github.com/google-deepmind/perception_test

Specifically https://storage.googleapis.com/dm-perception-test/zip_data/valid_videos.zip

Thank you for your quick reply! I have two questions I would like to ask.

  1. Does the perception test not require gpt assistance for evaluation?
  2. Why is the input in the model_vqa_videoperception.py different from other vqa inference evaluations? def get_model_option returns loss?

mary-0830 avatar May 08 '24 08:05 mary-0830

  1. yeah, it does not require gpt assistance
  2. we followed the official repo to implemement the evaluation, you can refer to the offical repo