Jonathan Deakin
Jonathan Deakin
I am getting a very mysterious error, while using a different library ([AESlib.h](https://github.com/DavyLandman/AESLib)), I was getting a seg fault when using one particular function (aes_cbc_enc). At first I thought it...
# Description Adds class `acl_post_ops_t` which enables Compute Library for the Arm® Architecture (ACL) based primitives to have an arbitrary number and type of post ops by composing `acl_binary_t` and...
# Description This PR adds eltwise post ops to the Compute Library for the Arm® architecture (ACL) batch normalization primitive. ReLU (including leaky and bounded) are fused into the bnorm...
# Summary `test_concurrency` seg faults intermittently (~1 in 50), for example by running ``` ctest --repeat-until-fail 200 -R concurrency ``` which outputs ``` ... The following tests FAILED: 82 -...
Hi, first I want to say thank you for creating this package! I was looking to use it to solve the complex valued Helmholtz equation, however I can't find any...
In the README you recommend integrating complex functions as vectors with PAIRED error. Would you be averse to having a simple Julia wrapper which accepts functions which return complex values?...
# Description Fuse the sum post op in acl matmul by setting the accumulate flag to true in arm_compute::GEMMInfo. This speeds up the post op and saves allocating a temporary...
# Description This PR adds an s8:s8:f32 matmul implementation using `arm_compute::NEGEMMLowpMatrixMultiplyCore`. For moderate sized problems the new implementation is - several orders of magnitude faster than `gemm:jit` - ~3-4x faster...
# Summary test_benchdnn_modeC_rnn_ci_cpu failing on AArch64 with and without ACL because some inputs are returning unimplemented (see [test_benchdnn_modeC_rnn_ci_cpu.unimplemented.log](https://github.com/oneapi-src/oneDNN/files/14699605/test_benchdnn_modeC_rnn_ci_cpu.unimplemented.log)). The only failing cases have ``` tag=abc:abdec:abc tag=bac:abdec:bac ``` and there are...