alignment-handbook icon indicating copy to clipboard operation
alignment-handbook copied to clipboard

LoRA + FlashAttention2 speed up?

Open zhoumengbo opened this issue 2 years ago • 1 comments

When fine-tuning Mistral with LoRA, do you think FlashAttention2 helps in speeding up the process? If yes, how significant is the acceleration? Where is the primary acceleration achieved?

zhoumengbo avatar Nov 11 '23 07:11 zhoumengbo

Hi @zhoumengbo I don't recall if we benchmarked speed with FA2 and LoRA, but I do know that it's crucial in order to bring the vRAM usage down.

lewtun avatar Nov 13 '23 09:11 lewtun