Tri Dao

Results 435 comments of Tri Dao
trafficstars

Yea you can just download the wheel compiled with cuda 12.3. Should be compatible.

Thanks! Is the formatting by black using line length of [100](https://github.com/Dao-AILab/flash-attention/blob/main/flash_attn/pyproject.toml)?

Sorry I've just been busy. Let me take a look tomorrow.

Seems like a Triton error. You might have better luck searching their repo issues.

For those with AMD devices can you help test this PR?

softcapping is not supported yet in the backward pass