DV Lab

Results 63 repositories owned by DV Lab

Video-P2P

339
Stars
23
Forks
Watchers

Video-P2P: Video Editing with Cross-attention Control

Prompt-Highlighter

123
Stars
2
Forks
Watchers

[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs

MOOD

133
Stars
5
Forks
Watchers

Official PyTorch implementation of MOOD series: (1) MOODv1: Rethinking Out-of-distributionDetection: Masked Image Modeling Is All You Need. (2) MOODv2: Masked Image Modeling for Out-of-Distribution...

GroupContrast

28
Stars
1
Forks
Watchers

[CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

LLaMA-VID

541
Stars
33
Forks
Watchers

Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

LLMGA

449
Stars
29
Forks
Watchers

This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral

Mask-Attention-Free-Transformer

55
Stars
3
Forks
Watchers

Official Implementation for "Mask-Attention-Free Transformer for 3D Instance Segmentation"

MGM

3.2k
Stars
279
Forks
25
Watchers

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

MR-GSM8K

25
Stars
0
Forks
Watchers

Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs

spconv-plus

146
Stars
7
Forks
Watchers