He Ma

Results 2 repositories owned by He Ma

cublasgemm-benchmark

28
Stars
16
Forks
Watchers

code for benchmarking GPU performance based on cublasSgemm and cublasHgemm

cublasHgemm-P100

34
Stars
13
Forks
Watchers

Code for testing the native float16 matrix multiplication performance on Tesla P100 and V100 GPU based on cublasHgemm