flash-attention
flash-attention copied to clipboard
Is flashattention replace of multiheadattention support in NVIDIA DRIVE ORIN?
trafficstars
Hi, I have a question: Is flashattention replace of multiheadattention support in NVIDIA DRIVE ORIN? I want to deploy it in NVIDIA DRIVE ORIN.
Sent from PPHub
Probably. You can search github issues to see