XNNPACK icon indicating copy to clipboard operation
XNNPACK copied to clipboard

High-efficiency floating-point neural network inference operators for mobile, server, and Web

Results 342 XNNPACK issues
Sort by recently updated
recently updated
newest added

Fix a crash on internal benchmark with relaxedsimd

QS8 AVX2 broadcast reorder input and weight loads before conversions

Call xnnpack transpose from TfLite transpose and remove old optimized implementation

Generate neondot qc4w benchmarks. kr is in bytes.

Rename xnn_qd8_f32_qc4w_gemm_minmax_ukernel_fn and xnn_qd8_f32_qc8w_gemm_minmax_ukernel_fn

QS8 scalar GEMM template support unrolled microkernels - Unroll WASM by 4