Xiaohan Ding

Results 1 issues of Xiaohan Ding

Sometimes when the tensor format changes after this conv (e.g., NCHW -> NHWC for layer normalization), calling backward will raise an "input must be contiguous" error. Making the grad contiguous...