thinking-with-image topic

List thinking-with-image repositories

Stars

Forks

Watchers

ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

Stars

Forks

Watchers

Official implementation of "Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology"