cutlass
cutlass copied to clipboard
Update depthwise_conv2d_implicit_gemm.py
Sometimes when the tensor format changes after this conv (e.g., NCHW -> NHWC for layer normalization), calling backward will raise an "input must be contiguous" error. Making the grad contiguous in advance simply solves it.