TensorRT-LLM
TensorRT-LLM copied to clipboard
fix up qkv.bias error when use qwen1.5-32b-gptq-int4