DCNv2
DCNv2 copied to clipboard
Fixes for Pytorch 1.6 and CUDA 11.
I changed out the matrix * vector multiplication for matrix * matrix multiplication. This fixes the error "THFloatBlas_gemv is undefined" as well as the error "THCudaBlas_Sgemv is undefined"