Eric Hartford comments

Results 204 comments of


                                            Eric Hartford

FlashAttention support?

here is full stack trace: ``` Traceback (most recent call last): File "/home/eric/git/qlora/qlora.py", line 845, in train() File "/home/eric/git/qlora/qlora.py", line 807, in train train_result = trainer.train() File "/home/eric/miniconda3/envs/qlora/lib/python3.10/site-packages/transformers/trainer.py", line 1539,...

FlashAttention support?

I got around that problem by moving the patch code to an earlier point before loading the model. But I hit another error: ```RuntimeError: FlashAttention only support fp16 and bf16...

add flash attention

could be possible that qlora patches it again after I patch it?

add flash attention

ok trying

add flash attention

RuntimeError: FlashAttention only support fp16 and bf16 data type

add flash attention

seems maybe FlashAttention needs to be modified to support this

add support for AMD / ROCm / HIP

@tridao is it possible to merge this to support ROCm? https://github.com/ROCmSoftwarePlatform/flash-attention

add support for AMD / ROCm / HIP

I doubt they would disapprove of merging, Seems just a rift of communication. I will reach out.

add support for AMD / ROCm / HIP

I have mi100s, would love to be able to use them