flash-attention
flash-attention copied to clipboard
[Rotary] more varlen rotary function implement
- add varlen supporting of MHA layer
- add varlen supporting of ApplyRotaryEmbQKV_ / ApplyRotaryEmbKV_ (with test code)
- fix some code format & spell error
Thanks! Is the formatting by black using line length of 100?
Thanks! Is the formatting by black using line length of 100?
sorry my bad. I am formatting by black with default setting. And now the format looks fine.
Are there any plans to merge this soon? This feature would be really useful.
@tridao Hi~ Sorry for disturbing. Is there still any problems? We looking forward to merge this pr to main~
Sorry I've just been busy. Let me take a look tomorrow.
@tridao Sorry for disturbing again. It would be greatly appreciated if this PR could be merged. Thank you for your time again.
Hi, I'm curious about the status of this PR. This would be really useful to have merged!
+1, would be really useful.
@GGGGGGXY am I right in thinking this won't currently work for cross attention? How hard would it be to add support for this?
Hi, any progress with this PR?