FoundationVision

Results 5 repositories owned by FoundationVision

GLEE

930
Stars
74
Forks
23
Watchers

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

UniRef

227
Stars
12
Forks
Watchers

[ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces

VAR

3.6k
Stars
276
Forks
106
Watchers

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly y...

GenerateU

99
Stars
5
Forks
Watchers

[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

Groma

404
Stars
50
Forks
Watchers

Grounded Multimodal Large Language Model with Localized Visual Tokenization