VLMEvalKit
VLMEvalKit copied to clipboard
Request to add SIBench evaluation code
Added evaluation code for the SIBench paper: "How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective https://arxiv.org/abs/2509.18905 ".
-
Includes inference_mixed.py to support mixed inference for both images and videos.
-
Includes SIBench.py for processing the SIBenchmark.
-
Introduced a new MixedOutput format.
-
Added post-processing support for the MixedOutput format in run.py.
Please fix the lint issue
Thanks for the reminder. I have already checked the code according to the development guide pre-commit run --all-files, and now shows no formatting errors.