CUDALibrarySamples
CUDALibrarySamples copied to clipboard
Which is faster for cublas or cublaslt when multiplying matrices of float, half, and int8
hi, Which is faster for cublas or cublaslt when multiplying matrices of float, half, and int8
Hi @dingjingzhen, are you asking if cublas is faster than cublaslt (or vice-versa) for those three precisions or are you asking which precision has the highest throughput?