Matthew Nicely
Matthew Nicely
cuDNN SDPA doesn't support Turing GPUs.
Thanks for the request. This is something we've been thinking about but don't have the bandwidth to work on at the moment. Could you tell me more about your use...
https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#programmatic-dependent-launch-and-synchronization
https://github.com/NVIDIA/cutlass/issues/2302#issuecomment-2886934868
@hg0428 can you offer insight into why you need FA FP4 support? Have you tested the accuracy?
@tridao were working on it now! Will have an ETA soon
@IonThruster
What other requirements would you have for FP32/FP64, such as head dim, seq_len, etc...
@Anerudhan for hardware restrictions. @JayL323 I'lol look into the example. Do you have particular configuration in mind?