We are working towards a future that one foundation model can be a multi-purpose expert for low-level visual perception and visual evaluation.
Visual Evaluation with Foundation Models