dct_cuda
dct_cuda copied to clipboard
Hi, I'm quite new to using Torch with CUDA kernels. I'm struggling to build it on Colab, do you have any advice or resources you can point me to? Thank...
## Generality * The 2d implementation should also support 1d input. * What if input is not a power of 2 ## Specificity * Optimization for cases where `M =...
* multiplying scale in `precomputeExpk` * zero paddings to avoid branch divergence * in-place or out-of-place cufft, especially in idct * number of threads in idct `M/2 * N/2` or...