TransformerEngine
TransformerEngine copied to clipboard

Published 20 hours ago •

Reame
Issues

[Feature Request] Any roadmap for supporting FP8 attention calculation?

Open MoFHeka opened this issue 1 year ago • 1 comments

There is only FP16/BF16 being supported in class FusedAttention.

Aug 10 '23 12:08 MoFHeka