sys_reading
sys_reading copied to clipboard
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
https://github.com/Dao-AILab/flash-attention