FoundationVision

Results 6 repositories owned by FoundationVision

GLEE

1.0k
Stars
82
Forks
23
Watchers

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

UniRef

233
Stars
14
Forks
Watchers

[ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces

VAR

6.3k
Stars
422
Forks
121
Watchers

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simp...

GenerateU

128
Stars
6
Forks
Watchers

[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

Groma

543
Stars
57
Forks
Watchers

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

LlamaGen

1.2k
Stars
45
Forks
Watchers

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation