ipex-llm icon indicating copy to clipboard operation
ipex-llm copied to clipboard

OOM on multiple-ARC with vllm serving

Open jessie-zhao opened this issue 7 months ago • 1 comments

Run vllm serving test on ARC with below issue: NFO 07-04 19:10:08 async_llm_engine.py:152] Aborted request cmpl-e5fb5cad96e9402dabbbece3611ae22f-0. INFO: 127.0.0.1:41772 - "POST /v1/completions HTTP/1.1" 500 Internal Server Error ERROR: Exception in ASGI application Traceback (most recent call last): File "/usr/local/lib/python3.11/dist-packages/uvicorn/protocols/http/httptools_impl.py", line 399, in run_asgi result = await app( # type: ignore[func-returns-value]

Model: ChatGLM3-6B / Qwen1.5-14 on 2Arc/Qwen1.5-32B on 4Arc HW: XeonW + Arc

jessie-zhao avatar Jul 11 '24 07:07 jessie-zhao