qwen-vl topic
List
qwen-vl repositories
PaddleMIX
345
Stars
128
Forks
Watchers
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
awesome-vlm-architectures
393
Stars
22
Forks
Watchers
Famous Vision Language Models and Their Architectures
webmarker
30
Stars
3
Forks
Watchers
Mark web pages for use with vision-language models
lmms-finetune
166
Stars
21
Forks
Watchers
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, qwen-vl, qwen2-vl, phi3-v etc.