VILA
VILA copied to clipboard
About perception testset
Hello authors, Thanks for sharing fantastic jobs. Now I would like to ask where this dataset came from, can you share a link or data? "/lustre/fsw/portfolios/nvr/projects/nvr_elm_llm/dataset/video_datasets_v2/perception_test/"
https://github.com/google-deepmind/perception_test
Specifically https://storage.googleapis.com/dm-perception-test/zip_data/valid_videos.zip
https://github.com/google-deepmind/perception_test
Specifically https://storage.googleapis.com/dm-perception-test/zip_data/valid_videos.zip
Thank you for your quick reply! I have two questions I would like to ask.
- Does the perception test not require gpt assistance for evaluation?
- Why is the input in the model_vqa_videoperception.py different from other vqa inference evaluations? def get_model_option returns loss?
- yeah, it does not require gpt assistance
- we followed the official repo to implemement the evaluation, you can refer to the offical repo