PKU-YUAN-Lab (袁粒课题组-北大信工)
PKU-YUAN-Lab (袁粒课题组-北大信工)
Video-Bench
A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!
Chat-UniVi
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Hallucination-Attack
Attack to induce LLMs within hallucinations
MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
TaxDiff
The official code for "TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation"
MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators