AutoAWQ the efficient of duo

the efficient of duo_scaling

Open Skyseaee opened this issue 5 months ago • 0 comments

I have been experimenting with duo_scaling using the formula scales = (act_scales.pow(alpha) / w_scales.pow(1 - alpha)).clamp(min=1e-4).to(device).to(dtype) in auto AWQ. However, I observed a 1% decrease in accuracy when tested on the TextVQA dataset. I am looking for references or details regarding the use of scale / weight as opposed to using just scale to better understand this decline. Thanks for answering.

Sep 12 '24 10:09 Skyseaee

AutoAWQ AutoAWQ copied to clipboard

the efficient of duo_scaling

AutoAWQ
AutoAWQ copied to clipboard