OpenGVLab
OpenGVLab
all-seeing
[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"
VideoMAEv2
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
LORIS
Long-Term Rhythmic Video Soundtracker, ICML2023
Awesome-DragGAN
Awesome-DragGAN: A curated list of papers, tutorials, repositories related to DragGAN
PonderV2
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
DDPS
Official Implementation of "Denoising Diffusion Semantic Segmentation with Mask Prior Modeling"
OmniQuant
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
InternVL-MMDetSeg
Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed
MM-NIAH
This is the official implementation of the paper "Needle In A Multimodal Haystack"
PIIP
[NeurIPS 2024 Spotlight ⭐️ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)