qwen2-vl topics

maestro

2.7k

Stars

220

Forks

2.7k

Watchers

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

roboflow

cross-modal

gpt-4

gpt-4-vision

instance-segmentation

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi...

modelscope

agent

aigc

baichuan

chatglm

PaddleMIX

708

Stars

223

Forks

708

Watchers

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...

PaddlePaddle

aigc

blip2

clip

coca

dify-with-qwen-vl

66

Stars

11

Forks

66

Watchers

视频理解：千问视频多模态模型 & Dify

soulteary

dify

qwen2

qwen2-vl

Qwen-VL-Series-Finetune

1.5k

Stars

190

Forks

1.5k

Watchers

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

2U1

chatbot

multimodal

qwen2-vl

vision-language

grps_trtllm

160

Stars

11

Forks

160

Watchers

Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, d...

NetEase-Media

ai-agent

chatglm

function-call

llama-index