multimodal-llm topic

List multimodal-llm repositories

vllm-safety-benchmark

63
Stars
2
Forks
Watchers

[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"

Ant-Multi-Modal-Framework

113
Stars
5
Forks
Watchers

Research Code for Multimodal-Cognition Team in Ant Group

MiniGPT-5

845
Stars
52
Forks
Watchers

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

MineDreamer

68
Stars
4
Forks
Watchers

This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control "

FireRedASR

1.6k
Stars
138
Forks
1.6k
Watchers

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recogn...