multimodal-large-language-models topic

List multimodal-large-language-models repositories

modelscope-agent

2.7k
Stars
308
Forks
Watchers

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

Awesome-LLMs-meet-Multimodal-Generation

322
Stars
17
Forks
Watchers

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

MileBench

23
Stars
1
Forks
Watchers

This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"

FreeVA

43
Stars
0
Forks
Watchers

FreeVA: Offline MLLM as Training-Free Video Assistant

SLAM-LLM

508
Stars
42
Forks
Watchers

Speech, Language, Audio, Music Processing with Large Language Model

matryoshka-mm

68
Stars
4
Forks
Watchers

Matryoshka Multimodal Models

DenseFusion

105
Stars
1
Forks
Watchers

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception

GAMA

67
Stars
6
Forks
Watchers

Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

OceanGPT

25
Stars
2
Forks
Watchers

[ACL 2024] OceanGPT: A Large Language Model for Ocean Science Tasks