OpenGVLab

Results 21 repositories owned by


                                            OpenGVLab

all-seeing

449

Stars

Forks

Watchers

[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"

OpenGVLab

all-seeing

dataset

region-text

VideoMAEv2

490

Stars

Forks

Watchers

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

OpenGVLab

action-detection

action-recognition

cvpr2023

foundation-model

LORIS

Stars

Forks

Watchers

Long-Term Rhythmic Video Soundtracker, ICML2023

OpenGVLab

aigc

diffusion-models

multi-modality

music-generation

Awesome-DragGAN

Stars

Forks

Watchers

Awesome-DragGAN: A curated list of papers, tutorials, repositories related to DragGAN

OpenGVLab

awesome-list

draggan

gan

PonderV2

300

Stars

Forks

Watchers

PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

OpenGVLab

3d-vision

foundation-models

pretraining

DDPS

Stars

Forks

Watchers

Official Implementation of "Denoising Diffusion Semantic Segmentation with Mask Prior Modeling"

OpenGVLab

diffusion-models

mask-prior-modeling

semantic-segmentation

OmniQuant

595

Stars

Forks

Watchers

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

OpenGVLab

large-language-models

llm

quantization

InternVL-MMDetSeg

Stars

Forks

Watchers

Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed

OpenGVLab

object-detection

semantic-segmentation

vision-foundation

MM-NIAH

Stars

Forks

Watchers

This is the official implementation of the paper "Needle In A Multimodal Haystack"

OpenGVLab

benchmark

long-context

multimodal-large-language-models

vision-language-model

PIIP

105

Stars

Forks

105

Watchers

[NeurIPS 2024 Spotlight ⭐️ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)

OpenGVLab

computer-vision

image-classification

instance-segmentation

multimodal-large-language-models