multimodal-llm topic

List multimodal-llm repositories

vllm-safety-benchmark

63
Stars
2
Forks
Watchers

[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"

Ant-Multi-Modal-Framework

113
Stars
5
Forks
Watchers

Research Code for Multimodal-Cognition Team in Ant Group

MiniGPT-5

845
Stars
52
Forks
Watchers

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

MineDreamer

68
Stars
4
Forks
Watchers

This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control "