Renato Barros Arantes comments

Results 7 comments of


                                            Renato Barros Arantes

Bug: Missing function register_module_for_export

Hi @KeijiBranshi, Yes, when I say TensorFlow 2.16 I mean tf-nightly. Thanks!

test_shuffle fails on aarch64 when BF16 data type is enabled.

Hi @kawakami-k, Sorry for the delay in responding to you. It is similar: ``` ./test_shuffle --gtest_filter="ShuffleSimpleBF16/shuffle_test_t.TestsShuffle/4" Note: Google Test filter = ShuffleSimpleBF16/shuffle_test_t.TestsShuffle/4:-*_GPU* [==========] Running 1 test from 1 test suite....

src:cpu:aarch64 restricts the conv operation to just one post-op when…

Hi @mgouicem The root cause is that OneDNN conducts all fp16 post-op operations in fp32 and then downgrades to fp16 at the end, leading to a rounding error. To bypass...

src:cpu:aarch64 restricts the conv operation to just one post-op when…

Hi @mgouicem, > This is the expected behavior in oneDNN, and this is exactly what benchdnn checks. I just confirmed that these two tests pass on x64 implementation, which has...

src:cpu:aarch64 restricts the conv operation to just one post-op when…

Hi @mgouicem, > Could you clarify how this remove the f32->f16 downconversion? I would expect that f32->f16 conversion to be here independently of the number of post-ops since convolution accumulates...

Pytorch quantization bias is not quantised on aarch64

Hi @shu1chen, Your answer is not related to my question that is about Pytorch, and not an example that you said I mentioned, but I did not. Maybe you are...

Enable external brgemm API on aarch64

Now that the API is enabled, I expected the example `examples/ukernels/cpu_brgemm.cpp` to be functional. Still, when running it after enabling `-DDNNL_EXPERIMENTAL_UKERNEL=ON`, it returns the error `Kernel is not supported on...