ComputeLibrary issues

NEGEMMLowpMatrixMultiplyCore: GEMMLowpOutputStageInfo fusing to speed up inference

Hi guys, I'm extremelly interested to speed up int8 `MatMul` inference with ARM Compute Library kernel. My model is: ```mermaid graph TD; Input1["Input out: fp32"] Quantise1["NEQuantizationLayer out: signed int8"] Input2["Input...

eshoguli

Help wanted

gemm+silu fused operator is not supported in ACL

1

**Output of 'strings libarm_compute.so | grep arm_compute_version':** arm_compute_version=v24.04 Build options: {'Werror': '1', 'build_dir': '//acl/build', 'debug': '0', 'neon': '1', 'opencl': '0', 'os': 'linux', 'openmp': '1', 'cppthreads': '0', 'arch': 'armv8.2-a', 'multi_isa': '1',...

TianyuLi0

Feature Request

Build does not work with GCC 15+

9

With GCC 15, the build fails due to missing cstdint include. GCC 15 removed some extra includes in the standard headers and caused the issue. See https://gcc.gnu.org/gcc-15/porting_to.html for more information....

pinskia

Help wanted

Could not Compile ARM Compute library on Raspberry Pi

3

**Output of 'strings libarm_compute.so | grep arm_compute_version':** **Platform: Raspberry Pi 5** **Operating System: Raspberry Pi Bookworm OS** **GCC version:** g++ -v Using built-in specs. COLLECT_GCC=g++ COLLECT_LTO_WRAPPER=/usr/lib/gcc/arm-linux-gnueabihf/12/lto-wrapper Target: arm-linux-gnueabihf Configured with:...

pegasus-git

Help wanted

Does this provide support for SVE2 on ARM Cortex A76?

1

Hi, I am bit of a beginner here and I need some help with this **Output of 'strings libarm_compute.so | grep arm_compute_version':** **Platform:** Raspberry Pi 5 (ARM Cortex A76) **Operating...

Abhranta

Help wanted

How to use the provided SVE GEMM code？

3

Hello, I am considering using the SVE instruction set to optimize GEMM operators. I found that although the repository has relevant codes, there is no example telling me how to...

yohuna77777

Help wanted

Possible bug in `CpuIm2ColKernel.cpp`

1

Hello, I'm trying to make sense of the code, and stumbled onto [this piece of code](https://github.com/ARM-software/ComputeLibrary/blob/de7288cb71e6b9190f52e50a44ed68c309e4a041/src/cpu/kernels/CpuIm2ColKernel.cpp#L309): ```c++ _convolved_dims = scaled_dimensions(src->dimension(width_idx), dst->dimension(height_idx), _kernel_width, _kernel_height, _conv_info, _dilation); ``` Shouldn't it be `src->dimension(height_idx)`?...

VladislavZavadskyy

Question

Build fails with android-ndk-r27b toolkit

1

Version: main, v24.08 **Platform: armv7a** **Operating System: Android** **Problem description:** ComputeLibrary wouldn't build for neon=1 arch=armv7a os=android with Android NDK r27b (LTS, latest at this moment), r26d (prevoius LTS) Build...

culhatsker

Help wanted

Can't find the MobileNetV2 weights

I can't find the mobilenet_v2_1.0_224.tgz to get the network description, weights etc.

matthiasatims

Help wanted

NEConvolutionLayer \ NEGEMMConvolutionLayer: F32 dequantized output for `QASYMM8` \ `QASYMM8_SIGNED` inputs

6

Is it possible to support F32 dequantized output for `QASYMM8` \ `QASYMM8_SIGNED` inputs in NEConvolutionLayer / NEGEMMConvolutionLayer?

alvoron

Feature Request

ComputeLibrary
ComputeLibrary copied to clipboard

Metadata

NEGEMMLowpMatrixMultiplyCore: GEMMLowpOutputStageInfo fusing to speed up inference

gemm+silu fused operator is not supported in ACL

Build does not work with GCC 15+

Could not Compile ARM Compute library on Raspberry Pi

Does this provide support for SVE2 on ARM Cortex A76?

How to use the provided SVE GEMM code？

Possible bug in `CpuIm2ColKernel.cpp`

Build fails with android-ndk-r27b toolkit

Can't find the MobileNetV2 weights

NEConvolutionLayer \ NEGEMMConvolutionLayer: F32 dequantized output for `QASYMM8` \ `QASYMM8_SIGNED` inputs

← Metadata

Owner

Metadata

ComputeLibrary ComputeLibrary copied to clipboard

Metadata

← Metadata

Owner

Metadata

ComputeLibrary
ComputeLibrary copied to clipboard