Tensile icon indicating copy to clipboard operation
Tensile copied to clipboard

Async DMA support for Transpose Data Layout

Open ramjana opened this issue 2 years ago • 7 comments

Support to move directly to LDS support for Transpose Data layout Added feature constraints for GLVW>DWORD

ramjana avatar May 24 '22 20:05 ramjana

I quickly ran DGEMM NN cases. It fails with GlobalReadVectorWidth=1 and VectorWidth=2 and ThreadSeparateGlobalReadB=0.

nakajee avatar May 24 '22 20:05 nakajee

Thanks for your update. With your latest change, dgemm NN + ThreadSeparateGlobalReadB tests passed, but dgemm TN + ThreadSeparateGlobalReadA still failed.

nakajee avatar May 26 '22 19:05 nakajee

Also, please merge the latest change in develop branch.

nakajee avatar May 26 '22 19:05 nakajee

Thank you for your update. Please add some test cases for ThreadSeparateGlobalReadA as well.

nakajee avatar Jun 08 '22 15:06 nakajee

Your test case failed in CI test. I think you need to add the following line as other precheckin tests to avoid failing with other architectures.

TestParameters: marks: [skip-gfx900, skip-gfx906, skip-gfx908, skip-gfx1010, skip-gfx1011, skip-gfx1012, skip-gfx1030] # not supported by arch

nakajee avatar Jun 08 '22 19:06 nakajee

Thanks for your update. Please resolve the conflicts so that CI tests can run. By the way, it seems like you have not added Test parameters to skip unsupported architectures (skip-gfx???). Some of CI tests on unsupported arch will fail without this line.

nakajee avatar Jun 15 '22 14:06 nakajee

CI test failed again. Please update your test case. WIthout skip, it will not pass.

nakajee avatar Jun 17 '22 14:06 nakajee

Move to #1574

nakajee avatar Aug 25 '22 16:08 nakajee