John Osorio

Results 13 comments of John Osorio

Hi @elaith9 I was reading your posts, and a lot of posts. Please if someone already solved this problem please help me. There are very interesting. But I'm having a...

@elaith9 thanks for your answer. @dqthebt24 I was trying with detectnet but I suppose that in fact I didn't use enough epochs. Im going to test again.

We have updated the PR. We follow the equation to apply the destination scaling after the post-operations. As `cudnnConvolutionForward` does not support having its output in `s32` format, we follow...

> There is another class of test cases that fail: no dst scale, but dst is s8: ./build/tests/benchdnn/benchdnn --conv --engine=gpu --skip-impl=ref --dir=FWD_I --dt=s8:s8:s8 --attr-scales=src:common:2 --attr-post-ops=sum mb1ic512iw121oc512ow122kw6pw3nconv1d:21 > > Even dst...

@dzarukin I did the squash as you mentioned. Thank you very much. Let me know if something else needs to be done.

@dzarukin We take into account what was discussed during the meeting. The PR now avoids applying pooling src:dst (u8:u8, s8:s8).

> > @dzarukin We take into account what was discussed during the meeting. The PR now avoids applying pooling src:dst (u8:u8, s8:s8). > > Could you clarify what issue does...

> > @mgouicem We received feedback mentioning that we need to avoid the use of `u8:u8` and `s8:s8`. Maybe @dzarukin have any additional clue about it. If this combination is...

> > > > @mgouicem We received feedback mentioning that we need to avoid the use of `u8:u8` and `s8:s8`. Maybe @dzarukin have any additional clue about it. If this...

To give a little bit of additional context. I found that in this [line](https://github.com/oneapi-src/oneDNN/blob/08fd0eee29cc523ae0f4905c148513c43e7e948d/src/gpu/generic/sycl/layer_normalizations_kernels.hpp#L218). The division is making **v_variance** slightly different from the reference that is computed in benchdnn (there...