John Osorio comments

Results 13 comments of


                                            John Osorio

Detectnet for several classes

Hi @elaith9 I was reading your posts, and a lot of posts. Please if someone already solved this problem please help me. There are very interesting. But I'm having a...

Detectnet for several classes

@elaith9 thanks for your answer. @dqthebt24 I was trying with detectnet but I suppose that in fact I didn't use enough epochs. Im going to test again.

gpu: nvidia: conv: Fix int8 convolution primitive fails

We have updated the PR. We follow the equation to apply the destination scaling after the post-operations. As `cudnnConvolutionForward` does not support having its output in `s32` format, we follow...

gpu: nvidia: conv: Fix int8 convolution primitive fails

> There is another class of test cases that fail: no dst scale, but dst is s8: ./build/tests/benchdnn/benchdnn --conv --engine=gpu --skip-impl=ref --dir=FWD_I --dt=s8:s8:s8 --attr-scales=src:common:2 --attr-post-ops=sum mb1ic512iw121oc512ow122kw6pw3nconv1d:21 > > Even dst...

gpu: nvidia: conv: Fix int8 convolution primitive fails

@dzarukin I did the squash as you mentioned. Thank you very much. Let me know if something else needs to be done.

Add support for different src and dst datatypes in the SYCL implementation for pooling

@dzarukin We take into account what was discussed during the meeting. The PR now avoids applying pooling src:dst (u8:u8, s8:s8).

Add support for different src and dst datatypes in the SYCL implementation for pooling

> > @dzarukin We take into account what was discussed during the meeting. The PR now avoids applying pooling src:dst (u8:u8, s8:s8). > > Could you clarify what issue does...

Add support for different src and dst datatypes in the SYCL implementation for pooling

> > @mgouicem We received feedback mentioning that we need to avoid the use of `u8:u8` and `s8:s8`. Maybe @dzarukin have any additional clue about it. If this combination is...

Add support for different src and dst datatypes in the SYCL implementation for pooling

> > > > @mgouicem We received feedback mentioning that we need to avoid the use of `u8:u8` and `s8:s8`. Maybe @dzarukin have any additional clue about it. If this...

gpu: generic: sycl: lnorm Intel GPU precision issues

To give a little bit of additional context. I found that in this [line](https://github.com/oneapi-src/oneDNN/blob/08fd0eee29cc523ae0f4905c148513c43e7e948d/src/gpu/generic/sycl/layer_normalizations_kernels.hpp#L218). The division is making **v_variance** slightly different from the reference that is computed in benchdnn (there...