oneflow icon indicating copy to clipboard operation
oneflow copied to clipboard

graph support flow.autograd.grad

Open hjchen2 opened this issue 2 years ago • 8 comments

hjchen2 avatar Sep 23 '22 03:09 hjchen2

Speed stats:

github-actions[bot] avatar Sep 23 '22 07:09 github-actions[bot]

CI failed when running job: cpu-module. PR label automerge has been removed

github-actions[bot] avatar Sep 23 '22 08:09 github-actions[bot]

Speed stats:

github-actions[bot] avatar Sep 23 '22 08:09 github-actions[bot]

CI failed when running job: cpu-module. PR label automerge has been removed

github-actions[bot] avatar Sep 23 '22 10:09 github-actions[bot]

Speed stats:

github-actions[bot] avatar Sep 23 '22 10:09 github-actions[bot]

Speed stats:
GPU Name: GeForce GTX 1080 









❌ OneFlow resnet50 time: 140.1ms (= 14012.2ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 160.6ms (= 16060.0ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.15 (= 160.6ms / 140.1ms)

OneFlow resnet50 time: 85.2ms (= 8524.3ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 102.2ms (= 10217.8ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.20 (= 102.2ms / 85.2ms)

OneFlow resnet50 time: 58.2ms (= 11633.0ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 78.0ms (= 15606.0ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.34 (= 78.0ms / 58.2ms)

OneFlow resnet50 time: 45.0ms (= 8998.3ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 78.9ms (= 15776.5ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.75 (= 78.9ms / 45.0ms)

OneFlow resnet50 time: 40.7ms (= 8147.7ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 65.7ms (= 13149.5ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.61 (= 65.7ms / 40.7ms)

github-actions[bot] avatar Sep 28 '22 05:09 github-actions[bot]

View latest API docs preview at: https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/9136/

github-actions[bot] avatar Sep 28 '22 05:09 github-actions[bot]

CI failed when running job: cuda-module. PR label automerge has been removed

github-actions[bot] avatar Sep 28 '22 05:09 github-actions[bot]

Speed stats:
GPU Name: GeForce GTX 1080 









❌ OneFlow resnet50 time: 140.2ms (= 14023.1ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 164.2ms (= 16424.6ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.17 (= 164.2ms / 140.2ms)

OneFlow resnet50 time: 86.9ms (= 8694.0ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 104.6ms (= 10460.7ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.20 (= 104.6ms / 86.9ms)

OneFlow resnet50 time: 59.0ms (= 11794.0ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 79.7ms (= 15938.9ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.35 (= 79.7ms / 59.0ms)

OneFlow resnet50 time: 45.8ms (= 9163.1ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 65.4ms (= 13074.0ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.43 (= 65.4ms / 45.8ms)

OneFlow resnet50 time: 40.9ms (= 8177.6ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 70.8ms (= 14158.4ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.73 (= 70.8ms / 40.9ms)

github-actions[bot] avatar Feb 01 '23 15:02 github-actions[bot]