He Ma
Results
2
repositories owned by
He Ma
cublasgemm-benchmark
28
Stars
16
Forks
Watchers
code for benchmarking GPU performance based on cublasSgemm and cublasHgemm
cublasHgemm-P100
34
Stars
13
Forks
Watchers
Code for testing the native float16 matrix multiplication performance on Tesla P100 and V100 GPU based on cublasHgemm