Matthew Nicely

Results 113 comments of Matthew Nicely

cuDNN SDPA doesn't support Turing GPUs.

Thanks for the request. This is something we've been thinking about but don't have the bandwidth to work on at the moment. Could you tell me more about your use...

https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#programmatic-dependent-launch-and-synchronization

https://github.com/NVIDIA/cutlass/issues/2302#issuecomment-2886934868

@hg0428 can you offer insight into why you need FA FP4 support? Have you tested the accuracy?

@tridao were working on it now! Will have an ETA soon

What other requirements would you have for FP32/FP64, such as head dim, seq_len, etc...

@Anerudhan for hardware restrictions. @JayL323 I'lol look into the example. Do you have particular configuration in mind?