pytorch
pytorch copied to clipboard
[DCP] Adds utility for converting torch save to dcp
Stack from ghstack (oldest at bottom):
- #120128
- #120127
- #119816
- -> #119815
- #119814
- #119813
as title
Differential Revision: D53718040
cc @mrshenli @pritamdamania87 @zhaojuanmao @satgera @rohan-varma @gqchen @aazzolini @osalpekar @jiayisuse @H-Huang @kwen2501 @awgu @penguinwu @fegin @XilunWu @wanchaol @fduwjj @wz337 @tianyu-l @wconstab @yf225
:link: Helpful Links
:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/119815
- :page_facing_up: Preview Python docs built from this PR
- :page_facing_up: Preview C++ docs built from this PR
- :question: Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours
Note: Links to docs will display an error until the docs builds have been completed.
:white_check_mark: You can merge normally! (3 Unrelated Failures)
As of commit 82f2ff43c77dde8c3f66b1dd92c48c8be9593fa7 with merge base c9459e7f555e645badc4803ecde1a3bde1f6b859 ():
FLAKY - The following job failed but was likely due to flakiness present on trunk:
-
trunk / macos-12-py3-arm64 / test (default, 3, 3, macos-m1-12) (gh)
dynamo/test_trace_rules.py::TraceRuleTests::test_torch_name_rule_map_updated
UNSTABLE - The following jobs failed but were likely due to flakiness present on trunk and has been marked as unstable:
-
periodic / linux-focal-rocm6.0-py3.8 / test (distributed, 1, 2, linux.rocm.gpu, unstable) (gh)
distributed/_composable/fsdp/test_fully_shard_training.py::TestFullyShard1DTrainingCompose::test_train_parity_with_activation_checkpointing
-
periodic / linux-focal-rocm6.0-py3.8 / test (distributed, 2, 2, linux.rocm.gpu, unstable) (gh)
distributed/test_c10d_nccl.py::NCCLTraceTestDumpOnTimeout::test_timeout_dumps_timing_enabled_True
This comment was automatically generated by Dr. CI and updates every 15 minutes.
This pull request was exported from Phabricator. Differential Revision: D53718040
This pull request was exported from Phabricator. Differential Revision: D53718040
This pull request was exported from Phabricator. Differential Revision: D53718040
This pull request was exported from Phabricator. Differential Revision: D53718040
This pull request was exported from Phabricator. Differential Revision: D53718040
This pull request was exported from Phabricator. Differential Revision: D53718040
@pytorchbot merge
Merge failed
Reason: This PR needs a release notes:
label
If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:
.
If not, please add the topic: not user facing
label.
To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"
For more information, see https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.
Details for Dev Infra team
Raised by workflow job
@pytorchbot merge
Merge started
Your change will be merged once all checks pass (ETA 0-4 Hours).
Learn more about merging in the wiki.
Questions? Feedback? Please reach out to the PyTorch DevX TeamAdvanced Debugging
Check the merge workflow status
here
Merge failed
Reason: Command git -C /home/runner/work/pytorch/pytorch cherry-pick -x faacbd37df359b57806dc61874e0deeced2e69b5
returned non-zero exit code 1
Auto-merging docs/source/distributed.checkpoint.rst
CONFLICT (content): Merge conflict in docs/source/distributed.checkpoint.rst
CONFLICT (modify/delete): test/distributed/checkpoint/test_format_utils.py deleted in HEAD and modified in faacbd37df3 ([DCP] Adds utility for converting torch save to dcp). Version faacbd37df3 ([DCP] Adds utility for converting torch save to dcp) of test/distributed/checkpoint/test_format_utils.py left in tree.
CONFLICT (modify/delete): torch/distributed/checkpoint/format_utils.py deleted in HEAD and modified in faacbd37df3 ([DCP] Adds utility for converting torch save to dcp). Version faacbd37df3 ([DCP] Adds utility for converting torch save to dcp) of torch/distributed/checkpoint/format_utils.py left in tree.
error: could not apply faacbd37df3... [DCP] Adds utility for converting torch save to dcp
hint: After resolving the conflicts, mark them with
hint: "git add/rm <pathspec>", then run
hint: "git cherry-pick --continue".
hint: You can instead skip this commit with "git cherry-pick --skip".
hint: To abort and get back to the state before "git cherry-pick",
hint: run "git cherry-pick --abort".
Details for Dev Infra team
Raised by workflow job
@pytorchbot merge
Merge started
Your change will be merged once all checks pass (ETA 0-4 Hours).
Learn more about merging in the wiki.
Questions? Feedback? Please reach out to the PyTorch DevX TeamAdvanced Debugging
Check the merge workflow status
here