dukelee111

Results 3 issues of dukelee111

Environment: Platform: 6548N+4ARC770 Docker Image: intelanalytics/ipex-llm-serving-xpu:2.1.0 servicing script: ![image](https://github.com/user-attachments/assets/3949f088-d83f-4844-9ab3-0f0c98604792) Error info: 1.With Dtype SYM_INT4 could succeed. 2.With Dtype FP8 failed with concurrency>=4. No error for concurrency 1 and 2. 2.GPU...

user issue
multi-arc

Please help to confirm if the GLM-4-9B-Chat is supported , thanks so much. Docker images:intelanalytics/ipex-llm-serving-vllm-xpu-experiment Tag:2.1.0b2 Image ID:0e20af44ad46 step: cd /benchmark/all-in-one edit config.yaml bash run-deepspeed-arc.sh Attached the error trace details:...

user issue

Environment: Platform: 6548N+1 ARC770 Docker Image: ![image](https://github.com/user-attachments/assets/c46ecf56-7dad-48b4-8c52-b67c2bb385ab) servicing script: ![image](https://github.com/user-attachments/assets/03ab8adf-7271-45af-b934-c619063a8a83) Error info: 1.With compression weight SYM_INT4 failed. 2.Has tried the parameter "gpu-memory-utilization" from 0.65 to 0.95 with step 0.05 could...

user issue
multi-arc