luminal
luminal copied to clipboard
FlashAttention
End goal is to automatically discover flash attention, but lots of work still needs to be done on an IR. For now lets hand code a kernel and find-and-replace it