OpenGVLab

Results 19 repositories owned by OpenGVLab

Ask-Anything

3.0k
Stars
247
Forks
14
Watchers

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

DragGAN

5.0k
Stars
492
Forks
63
Watchers

Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Wi...

InternGPT

3.2k
Stars
232
Forks
40
Watchers

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing,...

GITM

595
Stars
19
Forks
Watchers

Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory

InternImage

2.5k
Stars
231
Forks
Watchers

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

InternVideo

1.3k
Stars
85
Forks
Watchers

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Multi-Modality-Arena

450
Stars
34
Forks
Watchers

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP...

Instruct2Act

323
Stars
20
Forks
Watchers

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

InternVL

5.8k
Stars
456
Forks
48
Watchers

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型