tangjinchuan
tangjinchuan
If you need to test it on a AMD 7800XT, please send me your compiled windows tunner files with shfl enabled for Navi. I have already posted the previously tunned...
Also, is this ready to roll?
Dear Cedric, I have put the tuning results based on my A770 card today for you to look at or update. https://github.com/CNugteren/CLBlast/issues/1#issuecomment-1959880869
I suggest @0cc4m give it a try to check urgent test cases.
I see. Please try installing intel compute runtime if this is not the case. The tuning is optimized for the Intel NEO Platform, not the open-source Mesa Platform. https://github.com/intel/compute-runtime The...
@0cc4m Please give the latest version [24.05.28454.6] a go: https://github.com/intel/compute-runtime/releases, all the install steps are there. If the problem persists try turning the PC on and off again for I...
Thanks! Will have the test case a try next day. In the meantime, for any windows users, you can also give it a go. [clblast_test_xgemm.zip](https://github.com/CNugteren/CLBlast/files/14397345/clblast_test_xgemm.zip)
@CNugteren Could you please have a look at the results? It only produced ":". Is this correct? I used openBLAS as the comparison library and the exe did not show...
@0cc4m I remembered one test update from 1.6.1 to 1.6.2 is here https://github.com/CNugteren/CLBlast/commit/afb3d8a604f0b2aec50aeb267372767d27389c99 If you were using 1.6.2 previously with newly tunned results, I guess this worths trying: Replacing the...
The Xe laptop used to compile the test cases also only produce one ":". I guess it is not a single problem for A770 only.