Paddle icon indicating copy to clipboard operation
Paddle copied to clipboard

Support float8 gemm_fused of cutlass

Open Wangzheee opened this issue 8 months ago • 3 comments

PR Category

Performance Optimization

PR Types

New features

Description

pcard-71500

  1. Add cutlass kernel for fp8_fp8_half_gemm_fused(fuse gemm+bias+scale+act)
  2. Add api, phi_kernel of fp8_fp8_fp8_dual_gemm_fused(fuse gemm+gemm+swiglu)
  3. Add cutlass kernel for fp8_fp8_fp8_dual_gemm_fused

Wangzheee avatar Jun 05 '24 14:06 Wangzheee