oneflow icon indicating copy to clipboard operation
oneflow copied to clipboard

Dev linalg cross

Open mosout opened this issue 3 years ago • 23 comments

https://github.com/Oneflow-Inc/oneflow/issues/8898

mosout avatar Aug 22 '22 02:08 mosout

View latest API docs preview at: https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/8979/

github-actions[bot] avatar Sep 09 '22 08:09 github-actions[bot]

CI failed when running job: cpu-module. PR label automerge has been removed

github-actions[bot] avatar Sep 09 '22 08:09 github-actions[bot]

Speed stats:

github-actions[bot] avatar Sep 09 '22 08:09 github-actions[bot]

Static analysis with clang failed. PR label automerge has been removed

github-actions[bot] avatar Sep 09 '22 08:09 github-actions[bot]

CI failed when running job: cpu-module. PR label automerge has been removed

github-actions[bot] avatar Sep 16 '22 11:09 github-actions[bot]

Speed stats:

github-actions[bot] avatar Sep 16 '22 11:09 github-actions[bot]

Speed stats:

github-actions[bot] avatar Sep 21 '22 16:09 github-actions[bot]

https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/8979/generated/oneflow.linalg.cross.html

这里的文档生成好像有点问题,可以看看为啥

wyg1997 avatar Sep 22 '22 02:09 wyg1997

https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/8979/generated/oneflow.linalg.cross.html

这里的文档生成好像有点问题,可以看看为啥

这个问题已经修了 那个是生成的老的 等重新跑下ci就可以了

mosout avatar Sep 22 '22 02:09 mosout

Code got formatted by CI. Please request CI again if you still want to have this PR merged. If the PR is from a forked repo, please download the patch files from the GitHub Actions web page and apply them locally.

github-actions[bot] avatar Sep 22 '22 02:09 github-actions[bot]

Speed stats:

github-actions[bot] avatar Sep 22 '22 03:09 github-actions[bot]

Speed stats:
GPU Name: GeForce GTX 1080 









❌ OneFlow resnet50 time: 140.0ms (= 14001.3ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 165.8ms (= 16582.1ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.18 (= 165.8ms / 140.0ms)

OneFlow resnet50 time: 85.4ms (= 8544.2ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 102.4ms (= 10236.2ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.20 (= 102.4ms / 85.4ms)

OneFlow resnet50 time: 58.4ms (= 11675.5ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 78.1ms (= 15623.4ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.34 (= 78.1ms / 58.4ms)

OneFlow resnet50 time: 44.4ms (= 8881.8ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 70.3ms (= 14065.1ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.58 (= 70.3ms / 44.4ms)

OneFlow resnet50 time: 39.9ms (= 7982.8ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 67.7ms (= 13534.3ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.70 (= 67.7ms / 39.9ms)

github-actions[bot] avatar Sep 22 '22 12:09 github-actions[bot]

CI failed when running job: cpu-module. PR label automerge has been removed

github-actions[bot] avatar Sep 22 '22 12:09 github-actions[bot]

View latest API docs preview at: https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/8979/

github-actions[bot] avatar Sep 22 '22 12:09 github-actions[bot]

Speed stats:

github-actions[bot] avatar Sep 22 '22 12:09 github-actions[bot]

Speed stats:
GPU Name: GeForce GTX 1080 









❌ OneFlow resnet50 time: 139.8ms (= 13975.7ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 161.0ms (= 16095.6ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.15 (= 161.0ms / 139.8ms)

OneFlow resnet50 time: 86.0ms (= 8600.6ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 102.6ms (= 10261.0ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.19 (= 102.6ms / 86.0ms)

OneFlow resnet50 time: 58.6ms (= 11719.5ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 89.2ms (= 17845.1ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.52 (= 89.2ms / 58.6ms)

OneFlow resnet50 time: 44.7ms (= 8943.3ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 69.7ms (= 13946.3ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.56 (= 69.7ms / 44.7ms)

OneFlow resnet50 time: 41.5ms (= 8303.5ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 76.9ms (= 15386.5ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.85 (= 76.9ms / 41.5ms)

github-actions[bot] avatar Sep 22 '22 14:09 github-actions[bot]

Speed stats:
GPU Name: GeForce GTX 1080 









❌ OneFlow resnet50 time: 140.2ms (= 14019.6ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 161.1ms (= 16110.9ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.15 (= 161.1ms / 140.2ms)

OneFlow resnet50 time: 85.2ms (= 8521.8ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 110.9ms (= 11087.5ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.30 (= 110.9ms / 85.2ms)

OneFlow resnet50 time: 58.0ms (= 11594.5ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 78.3ms (= 15659.3ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.35 (= 78.3ms / 58.0ms)

OneFlow resnet50 time: 45.2ms (= 9046.2ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 68.4ms (= 13686.9ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.51 (= 68.4ms / 45.2ms)

OneFlow resnet50 time: 40.2ms (= 8047.8ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 68.4ms (= 13677.0ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.70 (= 68.4ms / 40.2ms)

github-actions[bot] avatar Sep 22 '22 15:09 github-actions[bot]

View latest API docs preview at: https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/8979/

github-actions[bot] avatar Sep 22 '22 15:09 github-actions[bot]

Speed stats:

github-actions[bot] avatar Sep 22 '22 15:09 github-actions[bot]

View latest API docs preview at: https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/8979/

github-actions[bot] avatar Sep 22 '22 16:09 github-actions[bot]

Speed stats:
GPU Name: NVIDIA GeForce GTX 1080 









❌ OneFlow resnet50 time: 149.4ms (= 14941.9ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 172.1ms (= 17207.6ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.15 (= 172.1ms / 149.4ms)

OneFlow resnet50 time: 96.0ms (= 9595.5ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 113.9ms (= 11392.4ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.19 (= 113.9ms / 96.0ms)

OneFlow resnet50 time: 70.1ms (= 14018.2ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 87.7ms (= 17546.5ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.25 (= 87.7ms / 70.1ms)

OneFlow resnet50 time: 60.0ms (= 12001.6ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 75.9ms (= 15181.3ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.26 (= 75.9ms / 60.0ms)

OneFlow resnet50 time: 55.2ms (= 11047.6ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 70.6ms (= 14120.4ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.28 (= 70.6ms / 55.2ms)

github-actions[bot] avatar Sep 22 '22 16:09 github-actions[bot]

View latest API docs preview at: https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/8979/

github-actions[bot] avatar Sep 22 '22 16:09 github-actions[bot]

Speed stats:
GPU Name: GeForce GTX 1080 









❌ OneFlow resnet50 time: 140.0ms (= 14004.5ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 164.0ms (= 16398.1ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.17 (= 164.0ms / 140.0ms)

OneFlow resnet50 time: 85.6ms (= 8559.4ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 112.2ms (= 11224.8ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.31 (= 112.2ms / 85.6ms)

OneFlow resnet50 time: 58.1ms (= 11621.8ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 77.6ms (= 15528.2ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.34 (= 77.6ms / 58.1ms)

OneFlow resnet50 time: 44.8ms (= 8953.0ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 69.5ms (= 13897.2ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.55 (= 69.5ms / 44.8ms)

OneFlow resnet50 time: 40.6ms (= 8125.6ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 67.1ms (= 13411.4ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.65 (= 67.1ms / 40.6ms)

github-actions[bot] avatar Sep 22 '22 16:09 github-actions[bot]

Speed stats:
GPU Name: GeForce GTX 1080 









❌ OneFlow resnet50 time: 140.1ms (= 14009.6ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 161.3ms (= 16128.5ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.15 (= 161.3ms / 140.1ms)

OneFlow resnet50 time: 86.0ms (= 8595.3ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 104.1ms (= 10414.7ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.21 (= 104.1ms / 86.0ms)

OneFlow resnet50 time: 58.0ms (= 11597.6ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 78.7ms (= 15736.4ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.36 (= 78.7ms / 58.0ms)

OneFlow resnet50 time: 45.1ms (= 9012.6ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 72.0ms (= 14400.2ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.60 (= 72.0ms / 45.1ms)

OneFlow resnet50 time: 39.9ms (= 7985.4ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 76.8ms (= 15365.6ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.92 (= 76.8ms / 39.9ms)

github-actions[bot] avatar Sep 23 '22 03:09 github-actions[bot]

View latest API docs preview at: https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/8979/

github-actions[bot] avatar Sep 23 '22 03:09 github-actions[bot]

CI failed when running job: cpu-module. PR label automerge has been removed

github-actions[bot] avatar Sep 23 '22 03:09 github-actions[bot]

View latest API docs preview at: https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/8979/

github-actions[bot] avatar Sep 25 '22 08:09 github-actions[bot]

Speed stats:
GPU Name: GeForce GTX 1080 









❌ OneFlow resnet50 time: 139.8ms (= 13981.3ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 161.6ms (= 16155.5ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.16 (= 161.6ms / 139.8ms)

OneFlow resnet50 time: 85.9ms (= 8586.2ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 102.2ms (= 10219.4ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.19 (= 102.2ms / 85.9ms)

OneFlow resnet50 time: 58.6ms (= 11721.0ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 78.2ms (= 15641.2ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.33 (= 78.2ms / 58.6ms)

OneFlow resnet50 time: 45.7ms (= 9139.6ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 76.1ms (= 15225.3ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.67 (= 76.1ms / 45.7ms)

OneFlow resnet50 time: 39.9ms (= 7979.0ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 66.5ms (= 13297.3ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.67 (= 66.5ms / 39.9ms)

github-actions[bot] avatar Sep 25 '22 08:09 github-actions[bot]

View latest API docs preview at: https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/8979/

github-actions[bot] avatar Sep 25 '22 09:09 github-actions[bot]

Speed stats:
GPU Name: GeForce GTX 1080 









❌ OneFlow resnet50 time: 140.0ms (= 14003.9ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 162.7ms (= 16268.2ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.16 (= 162.7ms / 140.0ms)

OneFlow resnet50 time: 86.2ms (= 8619.8ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 102.8ms (= 10276.4ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.19 (= 102.8ms / 86.2ms)

OneFlow resnet50 time: 58.4ms (= 11674.9ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 89.2ms (= 17836.3ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.53 (= 89.2ms / 58.4ms)

OneFlow resnet50 time: 45.1ms (= 9020.1ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 72.5ms (= 14493.0ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.61 (= 72.5ms / 45.1ms)

OneFlow resnet50 time: 40.2ms (= 8034.3ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 68.8ms (= 13759.2ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.71 (= 68.8ms / 40.2ms)

github-actions[bot] avatar Sep 25 '22 09:09 github-actions[bot]