tangjinchuan

Results 39 comments of tangjinchuan

If you need to test it on a AMD 7800XT, please send me your compiled windows tunner files with shfl enabled for Navi. I have already posted the previously tunned...

Dear Cedric, I have put the tuning results based on my A770 card today for you to look at or update. https://github.com/CNugteren/CLBlast/issues/1#issuecomment-1959880869

I suggest @0cc4m give it a try to check urgent test cases.

I see. Please try installing intel compute runtime if this is not the case. The tuning is optimized for the Intel NEO Platform, not the open-source Mesa Platform. https://github.com/intel/compute-runtime The...

@0cc4m Please give the latest version [24.05.28454.6] a go: https://github.com/intel/compute-runtime/releases, all the install steps are there. If the problem persists try turning the PC on and off again for I...

Thanks! Will have the test case a try next day. In the meantime, for any windows users, you can also give it a go. [clblast_test_xgemm.zip](https://github.com/CNugteren/CLBlast/files/14397345/clblast_test_xgemm.zip)

@CNugteren Could you please have a look at the results? It only produced ":". Is this correct? I used openBLAS as the comparison library and the exe did not show...

@0cc4m I remembered one test update from 1.6.1 to 1.6.2 is here https://github.com/CNugteren/CLBlast/commit/afb3d8a604f0b2aec50aeb267372767d27389c99 If you were using 1.6.2 previously with newly tunned results, I guess this worths trying: Replacing the...

The Xe laptop used to compile the test cases also only produce one ":". I guess it is not a single problem for A770 only.