Digant Desai
Digant Desai
cc @Gasoonjia - let's make sure these tests are also working for dim-order related stuff.
Yes XNNPACK supports it, we haven't wired it up to ExecuTorch yet. Both XNNPACK and Portable variants will be added in the long run, but as Stephen said we are...
Thanks!
In the case of the LLama2 Linear operation, the weights are quantized. There are various methods to perform quantization. In this instance, we utilized "Symmetric, per channel groupwise" quantization to...
> Request to enable a simple way to support fused quantized operators Not sure it this fits with the existing PT2 quant flow. Can you do such fusion post partitioning...
Can we close this?+
rebase please? There is a merge conflict, thanks.
Thanks. LGTM. Sorry for the delay.
@GregoryComer if merged can we close this?
Do we have something close to this in CI? Like a quantizer variant perhaps?