qwen2-vl topic

List qwen2-vl repositories

maestro

2.6k
Stars
219
Forks
2.6k
Watchers

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

ms-swift

3.6k
Stars
310
Forks
Watchers

Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Visio...

PaddleMIX

345
Stars
128
Forks
Watchers

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...

dify-with-qwen-vl

65
Stars
11
Forks
65
Watchers

视频理解:千问视频多模态模型 & Dify

Qwen2-VL-Finetune

50
Stars
5
Forks
Watchers

An open-source implementaion for fine-tuning Qwen2-VL series by Alibaba Cloud.

grps_trtllm

157
Stars
11
Forks
157
Watchers

Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, d...

illufly

74
Stars
8
Forks
74
Watchers

✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体

drivebench

217
Stars
13
Forks
217
Watchers

[ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

qwen-ai-provider

29
Stars
14
Forks
29
Watchers

Community-built Qwen AI Provider for Vercel AI SDK - Integrate Alibaba Cloud's Qwen models with Vercel's AI application framework

wd-llm-caption-cli

39
Stars
11
Forks
39
Watchers

A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.