TransformerEngine icon indicating copy to clipboard operation
TransformerEngine copied to clipboard

[Feature Request] Any roadmap for supporting FP8 attention calculation?

Open MoFHeka opened this issue 1 year ago • 1 comments

There is only FP16/BF16 being supported in class FusedAttention.

MoFHeka avatar Aug 10 '23 12:08 MoFHeka