large-vision-models topic
Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insigh...
safe-sora
SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enhance the helpfulness and harmlessness of Large Vision Models (L...
awesome-vision-time-series
This is an official repository for "Harnessing Vision Models for Time Series Analysis: A Survey".
Awesome-MLLM-Uncertainty
✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).
Neural-Brain-for-Embodied-Agents
Project Page for Paper "Neural Brain: A Neuroscience-inspired Framework for Embodied Agents".
DGMR
The official implementation of "Diversity-Guided MLP Reduction for Efficient Large Vision Transformers"