sagemaker-debugger icon indicating copy to clipboard operation
sagemaker-debugger copied to clipboard

Updated distributed_training.md

Open NihalHarish opened this issue 5 years ago • 2 comments

Description of changes:

  • Updated docs : docs/distributed_training.md
  • Currently still WIP: Need to add docs and examples for XGBoost
  • More examples to cover missing cases.
    • TF 1.x Horovod
    • TF 1.x Mirrored Strategy
    • TF 2.x Horovod Keras Fit API
    • TF 2.x Horovod Gradient Tape API
    • TF 2.x Mirrored Strategy
    • Pytorch Horovod Example
    • Pytorch distributed training
    • Mxnet Horovod

Style and formatting:

I have run pre-commit install to ensure that auto-formatting happens with every commit.

Issue number, if available

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

NihalHarish avatar Apr 21 '20 10:04 NihalHarish

@NihalHarish Please make the changes that we discussed last week offline.

Vikas-kum avatar Apr 27 '20 20:04 Vikas-kum

@NihalHarish Please make the changes that we discussed last week offline.

Changes discussed offline: Remove ZCC examples that need additional configuration,

NihalHarish avatar Apr 27 '20 22:04 NihalHarish