燃 comments

Results 67 comments of

燃

Abnormal output when using Qwen2.5-VL-72B-Instruct deployed with vLLM on 8*v100

> [@ZhouJYu](https://github.com/ZhouJYu) I tested locally and didn't encounter any errors; the results came out normally. Have you tried running the cases mentioned in the README? > > ``` > 这张图片显示了一辆停放在户外的地砖铺成的地面上的两轮机动车，摩托车旁边有一个软行李箱。摩托车被放置得靠得很近，触碰到了一个黄色的路面标记。地面的清洗条地上绘制了白色编号“106”，表示它作为停放位置的指南。周围有一些绿色的花朵和灌木丛，是人们日常活动频繁的地方，可能是住宅区。背景中有一个建筑物的一部分，可能是住宅大厦。...

Qwen2_5_VLForConditionalGeneration

> 本地推理时报错 `ImportError: cannot import name 'Qwen2_5_VLForConditionalGeneration' from 'transformers' (/root/miniconda3/envs/llama_factory/lib/python3.11/site-packages/transformers/__init__.py)` > > transformers版本 `root@vllm-dev:/home/aigc_worker/aigc/vllm# pip show transformers Name: transformers Version: 4.48.2 Summary: State-of-the-art Machine Learning for JAX, PyTorch and TensorFlow...

燃

Abnormal output when using Qwen2.5-VL-72B-Instruct deployed with vLLM on 8*v100

Qwen2_5_VLForConditionalGeneration

base64 encoded images for video

Long thinking time in Qwen3-VL-2B-Instruct

using a model of type qwen3_vl_moe to instantiate a model of type qwen3_vl

qwen2.5VL 7B模型推理，prompt和图片输入一致，hf推理结果与vllm推理结果很不一致。输出差异比较大。

Base64-encoded jpeg sequence doesn't work with VLLM API (Qwen2.5vl works)

qwen3 vl 235b instuct sglang 部署爆显存

qwen3 vl 235b instuct sglang 部署爆显存

有人用CPU跑过吗