Digant Desai comments

Results 94 comments of


                                            Digant Desai

Fix backends/arm test after _skip_dim_order update

cc @Gasoonjia - let's make sure these tests are also working for dim-order related stuff.

Support for transposed Conv ? ( ETA / Help with custom implementation )

Yes XNNPACK supports it, we haven't wired it up to ExecuTorch yet. Both XNNPACK and Portable variants will be added in the long run, but as Stephen said we are...

Add fp16 qb4w scalar kernels

Thanks!

what's the meaning of "Groupwise 4-bit (128)"

In the case of the LLama2 Linear operation, the weights are quantized. There are various methods to perform quantization. In this instance, we utilized "Symmetric, per channel groupwise" quantization to...

Support of Fused Quantized Operators

> Request to enable a simple way to support fused quantized operators Not sure it this fits with the existing PT2 quant flow. Can you do such fusion post partitioning...

Support of Fused Quantized Operators

Can we close this?+

ArmQuantizer: quantize dropout with SharedQuantizationSpec

rebase please? There is a merge conflict, thanks.

For Apple silicon, use machdep.cpu.brand_string in preference to decoding hw.machine

Thanks. LGTM. Sorry for the delay.

QB4W MLAL GEMM Kernels

@GregoryComer if merged can we close this?

CoreML Partitioner is not able to lower mv3

Do we have something close to this in CI? Like a quantizer variant perhaps?