VAR icon indicating copy to clipboard operation
VAR copied to clipboard

🌈 Introducing DiverseVAR, a training-free framework that unleashes the inherent generative diversity of Visual Autoregressive models while preserving fidelity and text–image alignment.

Open wangtong627 opened this issue 1 month ago • 0 comments

Thanks a lot to the inspiring progress in Visual Autoregressive (VAR) models!

We introduce DiverseVAR, a simple yet effective training-free framework that restores the lost generative diversity in VAR models.
By strategically manipulating the pivotal component at early scales, DiverseVAR significantly boosts diversity without harming fidelity or text-image alignment.
On both Infinity-2B and Infinity-8B, it consistently improves Recall, Coverage, and FID while keeping CLIP scores nearly unchanged.

Image

Arxiv: https://arxiv.org/abs/2511.17074
Github: https://github.com/wangtong627/DiverseVAR
Huggingface Daily Paper: https://huggingface.co/papers/2511.17074

wangtong627 avatar Nov 27 '25 09:11 wangtong627