Aquavanjaram942_80cu BF16 and HHS NN TN NT GEMM tuning
- BF16 NN, TN ,NT BF16 grid points improvement. ~14% improvement done.
hipBLAS-test results
[----------] 1 test from ExtOpTest/ExtOpAMaxUnsupportedDatatypeTest [ RUN ] ExtOpTest/ExtOpAMaxUnsupportedDatatypeTest.amaxFailureUnsupportedDatatype/0 [ OK ] ExtOpTest/ExtOpAMaxUnsupportedDatatypeTest.amaxFailureUnsupportedDatatype/0 (0 ms) [----------] 1 test from ExtOpTest/ExtOpAMaxUnsupportedDatatypeTest (0 ms total)
[----------] 1 test from ExtOpTest/ExtOpAMaxWithScaleUnsupportedDatatypeTest [ RUN ] ExtOpTest/ExtOpAMaxWithScaleUnsupportedDatatypeTest.amaxWithScaleFailureUnsupportedDatatype/0 [ OK ] ExtOpTest/ExtOpAMaxWithScaleUnsupportedDatatypeTest.amaxWithScaleFailureUnsupportedDatatype/0 (0 ms) [----------] 1 test from ExtOpTest/ExtOpAMaxWithScaleUnsupportedDatatypeTest (0 ms total)
[----------] Global test environment tear-down [==========] 48206 tests from 13 test suites ran. (807119 ms total) [ PASSED ] 48206 tests. hipBLASLt version: 1000
command line: ./hipblaslt-test
HHS equivalent of BBS kernels converted , validated and merged.