XNNPACK issues

how to refer to this project in paper?

Should we refer to one of the publications listed in README?

BF16 GEMM microkernels

Add kernel_elements as a parameter to depthwise convolution microkernels

Add kernel_elements as a parameter to depthwise convolution microkernels This will allow us to change depthwise convolution microkernels to support kernel elements up to primary tile size, instead of only...

copybara-service[bot]

Add fused operators support to convolution operators

Add fused operators support to convolution operators A specified set of operators can be fused into convolution using the new function xnn_create_convolution2d_nhwc_f32_fused, this creates a convolution operator with a list...

copybara-service[bot]

WIP pipe fused params through subgraph and operators

copybara-service[bot]

Add eager API for transpose.

copybara-service[bot]

Variable size transpose ukernels are treated as 1d ukernels. They are inherently 2d ukernels, element size contains one dimensions. When the input or output dimension is strided, then adding element size to input pointer in the outer loop will no longer point to the next element.

Variable size transpose ukernels are treated as 1d ukernels. They are inherently 2d ukernels, element size contains one dimensions. When the input or output dimension is strided, then adding element...

copybara-service[bot]

XNNPACK
XNNPACK copied to clipboard

Metadata

how to refer to this project in paper?

BF16 GEMM microkernels

Add kernel_elements as a parameter to depthwise convolution microkernels

Add fused operators support to convolution operators

WIP pipe fused params through subgraph and operators

Add eager API for transpose.

Variable size transpose ukernels are treated as 1d ukernels. They are inherently 2d ukernels, element size contains one dimensions. When the input or output dimension is strided, then adding element size to input pointer in the outer loop will no longer point to the next element.

SpaceToDepth xnnpack delegate

Add Space to depth to XNNPACK

Add avx & avx2 transpose microkernel generator

← Metadata

Owner

Metadata

XNNPACK XNNPACK copied to clipboard

Metadata

← Metadata

Owner

Metadata

XNNPACK
XNNPACK copied to clipboard