test-infra icon indicating copy to clipboard operation
test-infra copied to clipboard

[Pytorch] There are 7 Recurrently Failing Jobs on pytorch/pytorch nightly

Open github-actions[bot] opened this issue 2 years ago • 707 comments

Within the last 50 commits, there are the following failures on the main branch of pytorch:

Please review the errors and revert if needed.

github-actions[bot] avatar Mar 13 '23 07:03 github-actions[bot]

These jobs started failing:

  • pull / linux-bionic-py3.11-clang9 / test (crossref, 1, 2, linux.2xlarge)
  • pull / linux-bionic-py3.11-clang9 / test (crossref, 2, 2, linux.2xlarge)
  • pull / linux-bionic-py3.11-clang9 / test (default, 1, 2, linux.2xlarge)
  • pull / linux-bionic-py3.11-clang9 / test (default, 2, 2, linux.2xlarge)
  • pull / linux-bionic-py3.11-clang9 / test (dynamo, 1, 2, linux.2xlarge)
  • pull / linux-bionic-py3.11-clang9 / test (dynamo, 2, 2, linux.2xlarge)
  • pull / linux-bionic-py3.11-clang9 / test (functorch, 1, 1, linux.2xlarge)
  • pull / linux-bionic-py3.8-clang9 / test (crossref, 1, 2, linux.2xlarge)
  • pull / linux-bionic-py3.8-clang9 / test (crossref, 2, 2, linux.2xlarge)
  • pull / linux-bionic-py3.8-clang9 / test (default, 1, 2, linux.2xlarge)
  • pull / linux-bionic-py3.8-clang9 / test (default, 2, 2, linux.2xlarge)
  • pull / linux-bionic-py3.8-clang9 / test (dynamo, 1, 2, linux.2xlarge)
  • pull / linux-bionic-py3.8-clang9 / test (dynamo, 2, 2, linux.2xlarge)
  • pull / linux-bionic-py3.8-clang9 / test (functorch, 1, 1, linux.2xlarge)
  • pull / linux-bionic-py3_8-clang8-xla / test (xla, 1, 1, linux.4xlarge)
  • pull / linux-focal-py3.8-gcc7 / test (backwards_compat, 1, 1, linux.2xlarge)
  • pull / linux-focal-py3.8-gcc7 / test (default, 1, 2, linux.2xlarge)
  • pull / linux-focal-py3.8-gcc7 / test (default, 2, 2, linux.2xlarge)
  • pull / linux-focal-py3.8-gcc7 / test (docs_test, 1, 1, linux.2xlarge)
  • pull / linux-focal-py3.8-gcc7 / test (functorch, 1, 1, linux.2xlarge)
  • pull / linux-focal-py3.8-gcc7 / test (jit_legacy, 1, 1, linux.2xlarge)
  • pull / linux-focal-py3.9-clang7-asan / test (default, 1, 5, linux.4xlarge)
  • pull / linux-focal-py3.9-clang7-asan / test (default, 2, 5, linux.4xlarge)
  • pull / linux-focal-py3.9-clang7-asan / test (default, 3, 5, linux.4xlarge)
  • pull / linux-focal-py3.9-clang7-asan / test (default, 4, 5, linux.4xlarge)
  • pull / linux-focal-py3.9-clang7-asan / test (default, 5, 5, linux.4xlarge)
  • pull / linux-focal-py3.9-clang7-asan / test (functorch, 1, 1, linux.2xlarge)
  • pull / linux-vulkan-bionic-py3.11-clang9 / test (default, 1, 1, linux.2xlarge)

github-actions[bot] avatar Mar 13 '23 07:03 github-actions[bot]

These jobs started failing:

  • pull / linux-bionic-cuda11.7-py3.10-gcc7 / test (default, 1, 4, linux.4xlarge.nvidia.gpu)
  • pull / linux-bionic-cuda11.7-py3.10-gcc7 / test (default, 2, 4, linux.4xlarge.nvidia.gpu)
  • pull / linux-bionic-cuda11.7-py3.10-gcc7 / test (default, 3, 4, linux.4xlarge.nvidia.gpu)
  • pull / linux-bionic-cuda11.7-py3.10-gcc7 / test (default, 4, 4, linux.4xlarge.nvidia.gpu)
  • pull / linux-bionic-cuda11.7-py3.10-gcc7 / test (deploy, 1, 1, linux.4xlarge.nvidia.gpu)
  • pull / linux-bionic-cuda11.7-py3.10-gcc7 / test (distributed, 1, 3, linux.8xlarge.nvidia.gpu)
  • pull / linux-bionic-cuda11.7-py3.10-gcc7 / test (distributed, 2, 3, linux.8xlarge.nvidia.gpu)
  • pull / linux-bionic-cuda11.7-py3.10-gcc7 / test (functorch, 1, 1, linux.4xlarge.nvidia.gpu)
  • pull / linux-bionic-cuda11.7-py3.10-gcc7-bazel-test / build-and-test (default, 1, 1, linux.4xlarge.nvidia.gpu)
  • pull / linux-bionic-cuda11.7-py3.10-gcc7-sm86 / test (default, 1, 4, linux.g5.4xlarge.nvidia.gpu)
  • pull / linux-bionic-cuda11.7-py3.10-gcc7-sm86 / test (default, 2, 4, linux.g5.4xlarge.nvidia.gpu)
  • pull / linux-bionic-cuda11.7-py3.10-gcc7-sm86 / test (default, 3, 4, linux.g5.4xlarge.nvidia.gpu)
  • pull / linux-bionic-cuda11.7-py3.10-gcc7-sm86 / test (default, 4, 4, linux.g5.4xlarge.nvidia.gpu)
  • pull / linux-bionic-cuda11.7-py3.10-gcc7-sm86 / test (functorch, 1, 1, linux.g5.4xlarge.nvidia.gpu)
  • pull / linux-bionic-cuda11.7-py3.10-gcc7-sm86 / test (slow, 1, 2, linux.g5.4xlarge.nvidia.gpu)
  • pull / linux-bionic-cuda11.7-py3.10-gcc7-sm86 / test (slow, 2, 2, linux.g5.4xlarge.nvidia.gpu)

github-actions[bot] avatar Mar 13 '23 07:03 github-actions[bot]

These jobs started failing:

  • pull / linux-bionic-cuda11.7-py3.10-gcc7 / test (distributed, 3, 3, linux.8xlarge.nvidia.gpu)

github-actions[bot] avatar Mar 13 '23 07:03 github-actions[bot]

These jobs started failing:

  • pull / win-vs2019-cpu-py3 / test (default, 2, 2, windows.4xlarge)

github-actions[bot] avatar Mar 13 '23 08:03 github-actions[bot]

These jobs started failing:

  • trunk / linux-bionic-py3.8-clang9-slow / test (slow, 1, 1, linux.2xlarge)

github-actions[bot] avatar Mar 17 '23 07:03 github-actions[bot]

These jobs started failing:

  • trunk / linux-focal-py3.9-clang7-tsan / test (tsan, 1, 1, linux.2xlarge)
  • trunk / linux-focal-rocm5.4.2-py3.8 / test (default, 1, 2, linux.rocm.gpu)
  • trunk / linux-focal-rocm5.4.2-py3.8 / test (default, 2, 2, linux.rocm.gpu)

github-actions[bot] avatar Mar 17 '23 07:03 github-actions[bot]

These jobs started failing:

  • trunk / linux-bionic-cuda11.7-py3.10-gcc7 / test (jit_legacy, 1, 1, linux.4xlarge.nvidia.gpu)
  • trunk / linux-bionic-cuda11.7-py3.10-gcc7 / test (nogpu_AVX512, 1, 1, linux.2xlarge)
  • trunk / linux-bionic-cuda11.7-py3.10-gcc7 / test (nogpu_NO_AVX2, 1, 1, linux.2xlarge)
  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (default, 3, 4, linux.4xlarge.nvidia.gpu)
  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (default, 4, 4, linux.4xlarge.nvidia.gpu)
  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (distributed, 1, 3, linux.8xlarge.nvidia.gpu)
  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (jit_legacy, 1, 1, linux.4xlarge.nvidia.gpu)
  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (nogpu_AVX512, 1, 1, linux.2xlarge)
  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (nogpu_NO_AVX2, 1, 1, linux.2xlarge)

github-actions[bot] avatar Mar 17 '23 07:03 github-actions[bot]

These jobs started failing:

  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (default, 1, 4, linux.4xlarge.nvidia.gpu)
  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (default, 2, 4, linux.4xlarge.nvidia.gpu)
  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (distributed, 2, 3, linux.8xlarge.nvidia.gpu)
  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (distributed, 3, 3, linux.8xlarge.nvidia.gpu)
  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (functorch, 1, 1, linux.4xlarge.nvidia.gpu)

github-actions[bot] avatar Mar 17 '23 07:03 github-actions[bot]

These jobs started failing:

  • trunk / macos-12-py3-arm64 / test (default, 1, 2, macos-m1-12)
  • trunk / macos-12-py3-arm64 / test (default, 2, 2, macos-m1-12)
  • trunk / macos-12-py3-arm64 / test (functorch, 1, 1, macos-m1-12)

github-actions[bot] avatar Mar 17 '23 11:03 github-actions[bot]

These jobs started failing:

  • linux-binary-conda / conda-py3_10-cpu-test / test
  • linux-binary-conda / conda-py3_8-cpu-test / test
  • linux-binary-conda / conda-py3_9-cpu-test / test

github-actions[bot] avatar Mar 18 '23 07:03 github-actions[bot]

These jobs started failing:

  • linux-binary-conda / conda-py3_11-cpu-test / test

github-actions[bot] avatar Mar 18 '23 07:03 github-actions[bot]

These jobs started failing:

  • trunk / win-vs2019-cuda11.7-py3 / test (force_on_cpu, 1, 1, windows.4xlarge)

github-actions[bot] avatar Mar 18 '23 08:03 github-actions[bot]

These jobs started failing:

  • linux-binary-conda / conda-py3_10-cuda11_7-test / test
  • linux-binary-conda / conda-py3_11-cuda11_7-test / test
  • linux-binary-conda / conda-py3_8-cuda11_7-test / test
  • linux-binary-conda / conda-py3_9-cuda11_7-test / test

github-actions[bot] avatar Mar 18 '23 09:03 github-actions[bot]

These jobs started failing:

  • linux-binary-conda / conda-py3_10-cuda11_8-test / test
  • linux-binary-conda / conda-py3_11-cuda11_8-test / test
  • linux-binary-conda / conda-py3_8-cuda11_8-test / test
  • linux-binary-conda / conda-py3_9-cuda11_8-test / test

github-actions[bot] avatar Mar 18 '23 09:03 github-actions[bot]

These jobs started failing:

  • trunk / win-vs2019-cuda11.7-py3 / test (default, 5, 5, windows.g5.4xlarge.nvidia.gpu)

github-actions[bot] avatar Mar 19 '23 09:03 github-actions[bot]

These jobs stopped failing:

  • linux-binary-conda / conda-py3_11-cpu-test / test

github-actions[bot] avatar Mar 21 '23 04:03 github-actions[bot]

These jobs started failing:

  • linux-binary-conda / conda-py3_11-cpu-test / test

github-actions[bot] avatar Mar 21 '23 05:03 github-actions[bot]

These jobs stopped failing:

  • linux-binary-conda / conda-py3_11-cpu-test / test

github-actions[bot] avatar Mar 21 '23 05:03 github-actions[bot]

These jobs stopped failing:

  • linux-binary-conda / conda-py3_10-cpu-test / test
  • linux-binary-conda / conda-py3_8-cpu-test / test
  • linux-binary-conda / conda-py3_9-cpu-test / test

github-actions[bot] avatar Mar 22 '23 07:03 github-actions[bot]

These jobs stopped failing:

  • linux-binary-conda / conda-py3_11-cuda11_7-test / test

github-actions[bot] avatar Mar 22 '23 08:03 github-actions[bot]

These jobs stopped failing:

  • linux-binary-conda / conda-py3_10-cuda11_7-test / test
  • linux-binary-conda / conda-py3_8-cuda11_7-test / test
  • linux-binary-conda / conda-py3_9-cuda11_7-test / test

github-actions[bot] avatar Mar 22 '23 09:03 github-actions[bot]

These jobs stopped failing:

  • linux-binary-conda / conda-py3_10-cuda11_8-test / test
  • linux-binary-conda / conda-py3_11-cuda11_8-test / test
  • linux-binary-conda / conda-py3_9-cuda11_8-test / test

github-actions[bot] avatar Mar 22 '23 09:03 github-actions[bot]

These jobs stopped failing:

  • linux-binary-conda / conda-py3_8-cuda11_8-test / test

github-actions[bot] avatar Mar 22 '23 11:03 github-actions[bot]

These jobs started failing:

  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (default, 1, 5, linux.4xlarge.nvidia.gpu)
  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (default, 2, 5, linux.4xlarge.nvidia.gpu)
  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (default, 3, 5, linux.4xlarge.nvidia.gpu)
  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (default, 4, 5, linux.4xlarge.nvidia.gpu)
  • trunk / linux-bionic-cuda11.8-py3.10-gcc7 / test (default, 5, 5, linux.4xlarge.nvidia.gpu)

github-actions[bot] avatar Mar 23 '23 07:03 github-actions[bot]

These jobs started failing:

  • trunk / macos-12-py3-arm64 / test (default, 1, 3, macos-m1-12)
  • trunk / macos-12-py3-arm64 / test (default, 2, 3, macos-m1-12)

github-actions[bot] avatar Mar 23 '23 11:03 github-actions[bot]

These jobs started failing:

  • trunk / macos-12-py3-arm64 / test (default, 3, 3, macos-m1-12)

github-actions[bot] avatar Mar 23 '23 11:03 github-actions[bot]

These jobs started failing:

  • trunk / linux-focal-rocm5.4.2-py3.8 / test (default, 1, 3, linux.rocm.gpu)
  • trunk / linux-focal-rocm5.4.2-py3.8 / test (default, 2, 3, linux.rocm.gpu)
  • trunk / linux-focal-rocm5.4.2-py3.8 / test (default, 3, 3, linux.rocm.gpu)

github-actions[bot] avatar Mar 24 '23 07:03 github-actions[bot]

These jobs started failing:

  • trunk / win-vs2019-cuda11.7-py3 / test (default, 5, 6, windows.g5.4xlarge.nvidia.gpu)

github-actions[bot] avatar Mar 24 '23 09:03 github-actions[bot]

These jobs started failing:

  • trunk / win-vs2019-cuda11.7-py3 / test (default, 6, 6, windows.g5.4xlarge.nvidia.gpu)

github-actions[bot] avatar Mar 25 '23 09:03 github-actions[bot]

These jobs stopped failing:

  • trunk / win-vs2019-cuda11.7-py3 / test (default, 5, 6, windows.g5.4xlarge.nvidia.gpu)

github-actions[bot] avatar Mar 25 '23 09:03 github-actions[bot]