oneDNN
oneDNN copied to clipboard
oneAPI Deep Neural Network Library (oneDNN)
# Summary I have benchmarked various standard deep learning networks such as AlexNet, GoogleNet, ResNet50, and MobileNet-V2. I have observed that OneDNN v2.6 performs slower compared with v1.4. # Version...
# Description A link to the rendered document: [Link](https://github.com/igorsafo/oneDNN/tree/rfcs/rfcs/20220201-quantization-scaling)
# Description Adds class `acl_post_ops_t` which enables Compute Library for the Arm® Architecture (ACL) based primitives to have an arbitrary number and type of post ops by composing `acl_binary_t` and...
# Description This PR adds eltwise post ops to the Compute Library for the Arm® architecture (ACL) batch normalization primitive. ReLU (including leaky and bounded) are fused into the bnorm...
# Summary I have benchmarked LSTM layer using OneDNN for v1.4 and v2.6. I have observed that OneDNN v2.6 performs 1.5x slower compared with v1.4. # Version v2.6 # Environment...
This proposal aims to introduce a set of graph API into oneDNN. Rendered version: [link](https://github.com/TaoLv/mkl-dnn/blob/lvtao/rfcs/graph-api/rfcs/20220711-graph-api/README.md) cc @jianhui-li @igorsafo @mgouicem
[Link to rendered document](https://github.com/maayaneh/oneDNN/blob/bit_exact_conv_rfc/rfcs/20220630-bit-exact-conv/README.md)
# Description This PR extends binary SYCL kernel support for non-uniform group sizes. This includes a new logic for work-item config in kernel launch and handling the trailing portions of...
The root cause seems to be [this commit](https://github.com/oneapi-src/oneDNN/commit/e0e46ccdaec02e8ed4b9606564a721c225f33960). Is there any reason why this is the behavior? It came as a surprise to me because it wasn't specified in the...
## Context I'm trying to use OneDNN to implement arbitrary axis permutation for a tensor, e.g., ``` t = [[0, 1], [2, 3]] t.transpose(); // == t.transpose({1, 0}) ``` And...