tvm icon indicating copy to clipboard operation
tvm copied to clipboard

[CUBLAS][FP8] Enable fusing astype operation for matmul multiply pattern

Open valmat07 opened this issue 1 year ago • 1 comments

This PR adds fusing of the astype operation to matmul for cublas. This change is needed to improve the performance for fp8.

valmat07 avatar May 17 '24 12:05 valmat07

do we need to update cublas codegen or runtime to support the cast?

vinx13 avatar May 30 '24 23:05 vinx13