ModelCloud.ai

Results 1 repositories owned by ModelCloud.ai

GPTQModel

902
Stars
130
Forks
902
Watchers

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.