Ma Mingfei comments

Results 93 comments of


                                            Ma Mingfei

CPU performance update

@jcjohnson @renato2099 we have optimized torch performance on CPU platform. CPU optimization is provided by mklnn and mkltorch, to install torch with mklnn and mkltorch, refer to https://github.com/intel/torch currently we...

add channels last support for slow_conv_transpose2d

This patch enables `channels last` support for THNN's native transposed conv2d implementation, to ensure `convTranspose2d` will have identical behavior with v.s. without mkldnn on the CPU side.

add channels last support for slow_conv_transpose2d

@VitalyFedyunin, both #70897 and #74023 have dependency on #77060 (this was reverted due to test case failures, i have fixed it but it requires upgrade of `ideep`, so i marked...

opitimze ConvTransposedND with mkldnn float32 and bfloat16 on CPU

Reland #58348. **Need to upgrade ideep first to modify the memory format propagation logic.** put it under draft at the moment. Original error from https://hud.pytorch.org/pytorch/pytorch/commit/479e0d64e607d48fc61a958caf6aeaf165e3d45d is due to misjudge a...

opitimze ConvTransposedND with mkldnn float32 and bfloat16 on CPU

> Could you clear the OSS CI? @mingfeima The thing is our oneDNN-to-pytorch integration logic is mostly implemented in 'ideep' which is a third party repo to pytorch. We will...

opitimze ConvTransposedND with mkldnn float32 and bfloat16 on CPU

Update: old failures fixed by ideep upgrade. also lifting restrictions from `torch/testing/_internal/common_modules.py` since `ConvTranspose2d` has channels last support now, otherwise `python test_modules.py TestModuleCPU.test_memory_format_nn_ConvTranspose2d_cpu_float32` would fail.

Ma Mingfei

CPU performance update

add channels last support for slow_conv_transpose2d

add channels last support for slow_conv_transpose2d

opitimze ConvTransposedND with mkldnn float32 and bfloat16 on CPU

opitimze ConvTransposedND with mkldnn float32 and bfloat16 on CPU

opitimze ConvTransposedND with mkldnn float32 and bfloat16 on CPU

V2 Performance Signal Detected by TorchBench CI on '1.13.0.dev20220811+cu113'

V2 Performance Signal Detected by TorchBench CI on '1.13.0.dev20220811+cu113'

V2 Performance Signal Detected by TorchBench CI on '1.13.0.dev20220811+cu113'

V2 Performance Signal Detected by TorchBench CI on '1.13.0.dev20220811+cu113'