Visual Evaluation with Foundation Models

Results 5 repositories owned by Visual Evaluation with Foundation Models

Q-Align

159
Stars
12
Forks
Watchers

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Q-Bench

191
Stars
11
Forks
Watchers

①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and visual quality assessment.

Q-Instruct

163
Stars
8
Forks
Watchers

②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.

Co-Instruct

45
Stars
3
Forks
Watchers

④[Comparison among Multiple Images!] A study on open-ended multi-image quality comparison: a dataset, a model and a benchmark.

A-Bench

113
Stars
2
Forks
Watchers

[LMM + AIGC] What do we expect from LMMs as AIGI evaluators and how do they perform?