XNNPACK issues

Internal script change

copybara-service[bot]

F16-RMAX microkernel using AVX512 FP16 arithmetics

F16-RMAX microkernel using AVX512 FP16 arithmetics - Add build support for avx512fp16

copybara-service[bot]

Default condition missing for xnnpack_aggregate_library

1

Default condition is missing for xnnpack_aggregate_library. This causes TensorFlow build to fail on s390x

Nayana-ibm

x8-packw-x16c4 call x32 packw-x16

copybara-service[bot]

Switch to the new `rational_9_6` microkernels for `f32-vtanh` on `x86` and `x86_64`.

copybara-service[bot]

Add partial support for building/testing/benchmarking XNNPACK on Hexagon. Additional work would need to be done to get this fully working in the Bazel build (notably, connecting to a Qualcomm SDK) but this extends the basic build rules enough to add specializations for Hexagon to XNNPACK.

Add partial support for building/testing/benchmarking XNNPACK on Hexagon. Additional work would need to be done to get this fully working in the Bazel build (notably, connecting to a Qualcomm SDK)...

copybara-service[bot]

Enable subconv path for DQ TransposeConv

copybara-service[bot]

AVX512FP16 - add compiler flag guard around fp16 code

copybara-service[bot]

Add partial support for building/testing/benchmarking XNNPACK on Hexagon. Additional work would need to be done to get this fully working in the Bazel build (notably, connecting to a Qualcomm SDK) but this extends the basic build rules enough to add specializations for Hexagon to XNNPACK.

Add partial support for building/testing/benchmarking XNNPACK on Hexagon. Additional work would need to be done to get this fully working in the Bazel build (notably, connecting to a Qualcomm SDK)...

copybara-service[bot]

f16 & f32 generic mean operator & subgraph

copybara-service[bot]

XNNPACK
XNNPACK copied to clipboard

Metadata

Internal script change

F16-RMAX microkernel using AVX512 FP16 arithmetics

Default condition missing for xnnpack_aggregate_library

x8-packw-x16c4 call x32 packw-x16

Switch to the new `rational_9_6` microkernels for `f32-vtanh` on `x86` and `x86_64`.

Add partial support for building/testing/benchmarking XNNPACK on Hexagon. Additional work would need to be done to get this fully working in the Bazel build (notably, connecting to a Qualcomm SDK) but this extends the basic build rules enough to add specializations for Hexagon to XNNPACK.

Enable subconv path for DQ TransposeConv

AVX512FP16 - add compiler flag guard around fp16 code

Add partial support for building/testing/benchmarking XNNPACK on Hexagon. Additional work would need to be done to get this fully working in the Bazel build (notably, connecting to a Qualcomm SDK) but this extends the basic build rules enough to add specializations for Hexagon to XNNPACK.

f16 & f32 generic mean operator & subgraph

← Metadata

Owner

Metadata

XNNPACK XNNPACK copied to clipboard

Metadata

← Metadata

Owner

Metadata

XNNPACK
XNNPACK copied to clipboard