XNNPACK issues

Add partial support for building/testing/benchmarking XNNPACK on Hexagon. Additional work would need to be done to get this fully working in the Bazel build (notably, connecting to a Qualcomm SDK) but this extends the basic build rules enough to add specializations for Hexagon to XNNPACK.

Add partial support for building/testing/benchmarking XNNPACK on Hexagon. Additional work would need to be done to get this fully working in the Bazel build (notably, connecting to a Qualcomm SDK)...

copybara-service[bot]

Roll back of #6365 (Enable F16-RMINMAX and F16-RMAX microkernels using AVX512 FP16 arithmetics), breaks some internal tests.

copybara-service[bot]

[blockwise] GOI packing routine for qb4w

1

Test `packing-test --gtest_filter="PACK_QD8_F32_QB4W_GEMM_GOI_W.*"`

digantdesai

Add AVX512FP16 vbinary microkernels

Add AVX512FP16 vbinary microkernels - use fp16 native arithmetics for avx512 - vop and vopc templates - add, sub, mul, div, max, min, sqrdiff

copybara-service[bot]

Add f16_vsqrdiffc_test and f16_vsqrdiff_test build targets

Add f16_vsqrdiffc_test and f16_vsqrdiff_test build targets Fixes #6395

copybara-service[bot]

Add benchmarks for vbinary microkernels

1

F32 and F16 are missing vbinary benchmarks There are vunary benchmarks which are generated from tests And a few have 8 bit tests qs8-vadd.cc:#include qs8-vaddc.cc:#include qs8-vmul.cc:#include qs8-vmulc.cc:#include qu8-vadd.cc:#include qu8-vaddc.cc:#include qu8-vmul.cc:#include...

fbarchard

tests for vbinary f16_vsqrdiffc_test are missing a bazel build target

:f16_vsqrdiff_test :f16_vsqrdiffc_test But they do exit for f32

fbarchard

XNNPACK
XNNPACK copied to clipboard

Metadata

F16C f16-f32acc rdsum microkernels

Enable -mavx512fp16 needed for avx512fp16 microkernels

Add F16F32ACC AVX512SKX rdsum accumulating microkernels

Add partial support for building/testing/benchmarking XNNPACK on Hexagon. Additional work would need to be done to get this fully working in the Bazel build (notably, connecting to a Qualcomm SDK) but this extends the basic build rules enough to add specializations for Hexagon to XNNPACK.

Roll back of #6365 (Enable F16-RMINMAX and F16-RMAX microkernels using AVX512 FP16 arithmetics), breaks some internal tests.

[blockwise] GOI packing routine for qb4w

Add AVX512FP16 vbinary microkernels

Add f16_vsqrdiffc_test and f16_vsqrdiff_test build targets

Add benchmarks for vbinary microkernels

tests for vbinary f16_vsqrdiffc_test are missing a bazel build target

← Metadata

Owner

Metadata

XNNPACK XNNPACK copied to clipboard

Metadata

← Metadata

Owner

Metadata

XNNPACK
XNNPACK copied to clipboard