lyg95

Results 4 comments of lyg95
trafficstars

I have tested AWQ and GPTQ on llama-7b, but I do not see a speedup mentioned in the paper. It's true that GPTQ without act-order faster than with act-order. test...

The CPU info:Intel(R) Xeon(R) Gold 6230N CPU @ 2.30GHz When I tested, I use numactl to bind a single CPU core. Of course, I provide the code for my test...

Thank you for your suggestions. For the first two points, I will test it again. For the third point, the test script I provided may have misled you. When I...

Thanks for your reply. I conducted a more detailed analysis of the quantization model. I find that the aforementioned problem is not caused by the awq algorithm. It should be...