ShiningMaker issues

Repositories
Issues
Comments

Results 3 issues of


                                            ShiningMaker

Quantized QQQ models encountered configuration field exceptions and inference garbled text issues when deployed in vLLM 0.9.1.

First, thank you for your excellent work on this quantization library! I'm encountering two critical issues when deploying a quantized Qwen3-8B model to vLLM 0.9.1: - The initial deployment failed...

Quantized QQQ models encountered configuration field exceptions and inference garbled text issues when deployed in vLLM 0.9.1.

First, thank you for your excellent work on this quantization library! I'm encountering two critical issues when deploying a quantized Qwen3-8B model to vLLM 0.9.1: - The initial deployment failed...

[BUG] [CPU Memory OOM] DeekSpeek R1 got os oom-kill when packing model.layers

**Describe the bug** From my dmesg output, it is evident that the GPTQ Python process (PID 1179327) was killed by the kernel due to the system running out of memory...

bug