flash-attention icon indicating copy to clipboard operation
flash-attention copied to clipboard

Is flashattention replace of multiheadattention support in NVIDIA DRIVE ORIN?

Open wutheringcoo opened this issue 1 year ago • 1 comments
trafficstars

Hi, I have a question: Is flashattention replace of multiheadattention support in NVIDIA DRIVE ORIN? I want to deploy it in NVIDIA DRIVE ORIN.

Sent from PPHub

wutheringcoo avatar Aug 30 '24 11:08 wutheringcoo

Probably. You can search github issues to see

tridao avatar Sep 05 '24 07:09 tridao