LightCompress
LightCompress copied to clipboard
will you support two stage quantization of weight or activation as Qserve?
Hi, thanky you for your good job! Since fine-grained quantization has a significant impact on the results, is it possible to support an algorithm similar to the two-stage quantization of weights in Qserve?
Qserve will be supported in the future.