[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
FoundationVision
[CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation
henghuiding