[FMHA] add support for later CUDA (8.x)
add support for later minor dprops 8.0+ for CUDA DEVICES such as A40
@jqueguiner Thanks for the PR but I'm afraid we need to relax the condition in backward path as well in https://github.com/NVIDIA/apex/blob/f7421555c3d2ff01eed0e7c0c4321f3e4dd58fc6/apex/contrib/csrc/fmha/fmha_api.cpp#L182-L191.
Thank you for your patience in my delayed response.
@crcrpar : thanks for the feedback; should be fixed now.
thanks
Hi @crcrpar any update regarding this matter ?
I found that this FMHA failed when I ran https://github.com/NVIDIA/apex/blob/master/apex/contrib/test/fmha/test_fmha.py with this PR on A40 because the kernel might violate some limitation unfortunately...
cc'ing @yjk21 for visibility