tvm icon indicating copy to clipboard operation
tvm copied to clipboard

[Tracking Issue] Need support for GQA Attention in Relax

Open hamzaq5 opened this issue 4 months ago • 2 comments

This feature is critical for modern LLM compilation workflows and is currently not available in Relax. Adding native support for GQA in Relax will enable better performance and compatibility with transformer-based models exported from PyTorch, HuggingFace, or ONNX formats.

https://arxiv.org/abs/2305.13245

hamzaq5 avatar Aug 07 '25 20:08 hamzaq5

Can I get assigned on this please if its open to community contributions?

hamzaqureshi5 avatar Aug 09 '25 18:08 hamzaqureshi5

@hamzaqureshi5 Absolutely you can!

tlopex avatar Sep 26 '25 00:09 tlopex