mish-cuda
mish-cuda copied to clipboard
Performance gain vs normal way implementation
I'm not familiar with CUDA programming. Could you explain a little bit about the key factors in this implementation that brings performance gain? Thanks a lot!
@anhle-uet The performance in terms of generalization capability of the network is mirrored for both the normal implementation and the CUDA implementation, however, the CUDA implementation is much faster.