XNNPACK issues

Hexagon HVX implementation of F32 vbinary

We needed to adjust XNN_ALLOCATION_ALIGNMENT to 128 Byte for HVX and use predicate store for the tail part. Ctest passed but performance is not good yet. The next step naturally...

ejparkqc

unsupported instruction `vpdpbusd'

5

while building mediapipe with xnnpack, error occurs. (base) sstc@sstc-B450MH:~/0506/mediapipe$ bazel build -c opt --define MEDIAPIPE_DISABLE_GPU=1 mediapipe/examples/desktop/pose_tracking:pose_tracking_cpu --verbose_failures WARNING: /home/sstc/0506/mediapipe/mediapipe/framework/BUILD:69:24: in cc_library rule //mediapipe/framework:calculator_cc_proto: target '//mediapipe/framework:calculator_cc_proto' depends on deprecated target '@com_google_protobuf//:cc_wkt_protos':...

AndrewSmithWalter

XNNPACK
XNNPACK copied to clipboard

Metadata

Hexagon HVX implementation of F32 vbinary

unsupported instruction `vpdpbusd'

Fix caching of weights in `create_gemm_or_igemm` in `convolution-nhwc.cc`.

Hexagon build changes after rebase

AVX512FP16 - add compiler flag guard around fp16 code

Enable AVX512FP16 vmul vbinary microkernels

neon mlal qs8 rsum accumulating microkernels

scalar qs8 rsum accumulating microkernels

HVX F32-raddstoreexpminusmax for Softmax

Fix `generate-enum.py` to use `#include "..."` instead of `#include <...>`.

← Metadata

Owner

Metadata

XNNPACK XNNPACK copied to clipboard

Metadata

← Metadata

Owner

Metadata

XNNPACK
XNNPACK copied to clipboard