ipex-llm icon indicating copy to clipboard operation
ipex-llm copied to clipboard

On A770,vllm and llama.cpp which brings better performance for MiniCPM-v-2.6 ?

Open yangqing-yq opened this issue 1 year ago • 5 comments

Any data table for benchmark?

yangqing-yq avatar Sep 02 '24 02:09 yangqing-yq

Currently, our VLLM does not support multimodal models. Support for multimodal models is ongoing in the 0.5.x version of VLLM. We will notify you once it's ready.

liu-shaojun avatar Sep 02 '24 03:09 liu-shaojun

Is there a schedule when we could get a vllm supported version to run MiniCPM-v ?

yangqing-yq avatar Sep 02 '24 05:09 yangqing-yq

Is there a schedule when we could get a vllm supported version to run MiniCPM-v ?

hi, the upgrade of IPEX-LLM vLLM to 0.5.x is in progress, we will let you know once it's ready, and we will also try to support MiniCPM-v in the new version.

glorysdj avatar Sep 02 '24 07:09 glorysdj

Check progress, how is the status of vllm supporting MLLM like MiniCPM-V-2.6? @glorysdj

yangqing-yq avatar Sep 30 '24 01:09 yangqing-yq

the upgrade of IPEX-LLM vLLM to 0.5.4 is finished, and MiniCPM-V-2.6 is supported, please refer to

https://github.com/intel-analytics/ipex-llm/tree/main/python/llm/example/GPU/vLLM-Serving#image-input

glorysdj avatar Sep 30 '24 01:09 glorysdj