qwen-vl topic

List qwen-vl repositories

PaddleMIX

345
Stars
128
Forks
Watchers

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...

awesome-vlm-architectures

393
Stars
22
Forks
Watchers

Famous Vision Language Models and Their Architectures

webmarker

30
Stars
3
Forks
Watchers

Mark web pages for use with vision-language models

lmms-finetune

166
Stars
21
Forks
Watchers

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, qwen-vl, qwen2-vl, phi3-v etc.