cutlass icon indicating copy to clipboard operation
cutlass copied to clipboard

[QST] GEMM Epilogue Fusion: Element-wise Ops and Two-Tensor Element-wise Multiplication

Open HanGuo97 opened this issue 8 months ago • 8 comments

What is your question? Hi, I'd like to compute the following

D = f( matmul(A, B) ) * C

where f is an element-wise activation function, and C is a tensor of the same shape as the output D. In CUTLASS, it seems like only per-dimension scaling is supported in epilogue fusion, but not element-wise scaling between two tensors. In light of this, is such fusion profitable? If so, what are the best ways to implement this?

Thanks in advance for your time!

HanGuo97 avatar May 30 '24 18:05 HanGuo97