Koji Nakajima

Results 32 comments of Koji Nakajima

I quickly ran DGEMM NN cases. It fails with GlobalReadVectorWidth=1 and VectorWidth=2 and ThreadSeparateGlobalReadB=0.

Thanks for your update. With your latest change, dgemm NN + ThreadSeparateGlobalReadB tests passed, but dgemm TN + ThreadSeparateGlobalReadA still failed.

Also, please merge the latest change in develop branch.

Thank you for your update. Please add some test cases for ThreadSeparateGlobalReadA as well.

Your test case failed in CI test. I think you need to add the following line as other precheckin tests to avoid failing with other architectures. TestParameters: marks: [skip-gfx900, skip-gfx906,...

Thanks for your update. Please resolve the conflicts so that CI tests can run. By the way, it seems like you have not added Test parameters to skip unsupported architectures...

CI test failed again. Please update your test case. WIthout skip, it will not pass.

It looks like precheckin test is failing due to this change. Please check the log.

I quickly tried it but it caused the following error with aldebaran_Cijk_Alik_Bjlk_HHS_BH_GB.yaml. ==== Remove Unused Kernels ===================== Reading: tmp2/aldebaran_Cijk_Alik_Bjlk_HHS_BH_GB.yaml Traceback (most recent call last): File "RemoveUnusedKernels.py", line 82, in main()...