Bruce Lai

Results 21 comments of Bruce Lai

Hi @KaustubhIMG It seems that LMUL=2 codes have disappeared. Have you ever faced output our-of-tolerance error? ![image](https://github.com/google/XNNPACK/assets/55973122/e95fee5a-61e1-4f2b-998b-b5f214e90788)

oh. Do you mean that you didn't face our-of-tolerance error for LMUL=1 in your environment?

Hi @fbarchard This is the RVV IGEMM implementation and NR is also determined by LMUL & VLEN. The main idea is similar to https://github.com/google/XNNPACK/pull/5893.

Hi Frank, This PR now has several conflicts since `tools/generate-gemm-test.py` has been refactored. Let's try to approve and merge PR https://github.com/google/XNNPACK/pull/5893 at first. After that, I'll refactor this PR.

RVV IGEMM test cases has been updated.

Hi @fbarchard @Maratyszcza Do you have any suggestions about this Op?

@Maratyszcza This error only happens in clang. Can we add a patch to bypass `SET(CMAKE_C_STANDARD 99)` setting when using clang in both `XNNPACK` & `cpuinfo`? You can also check it...

@alankelly @fbarchard Could you help to review it? Also, I would like to ask about what's the appropriate way to enable RVV-only nr2 selection logic in following files. ``` src/operators/convolution-nhwc.c...

Hi @fbarchard @alankelly Could you help to merge this PR?

@fbarchard Thanks for your help.