Satish Pasumarthi

Results 10 issues of Satish Pasumarthi

### System information - **OS Platform and Distribution (e.g., Linux Ubuntu 16.04)**: Ubuntu 16.04 - **TensorFlow Serving installed from (source or binary)**: source - **TensorFlow Serving version**: 2.2.0-rc2 ### Describe...

type:build/install
stat:awaiting response

*Issue #, if available:* ORTE has lost communication with a remote daemon *Description of changes:* - Made changes to the non-leader nodes to sleep rather than wait on `orted` process....

*Issue #, if available:* *Description of changes:* - Add native pytorch DDP support - Add support for py39 - Connected PRs https://github.com/aws/sagemaker-python-sdk/pull/2705 https://github.com/aws/sagemaker-pytorch-training-toolkit/pull/231 - Rename `NCCL_MIN_NRINGS` to `NCCL_MIN_NCHANNELS` - Make...

Thank you for taking the time to submit an issue! ## Background information ### What version of Open MPI are you using? (e.g., v3.0.5, v4.0.2, git branch name and hash,...

question
Target: v4.1.x

*GitHub Issue #, if available:* Note: If merging this PR should also close the associated Issue, please also add that Issue # to the Linked Issues section on the right....

build
pytorch
Size:S

*GitHub Issue #, if available:* Note: If merging this PR should also close the associated Issue, please also add that Issue # to the Linked Issues section on the right....

build
pytorch
Size:S

*GitHub Issue #, if available:* Note: If merging this PR should also close the associated Issue, please also add that Issue # to the Linked Issues section on the right....

build
pytorch
Size:S

*Issue #, if available:* *Description of changes:* - Add support for torch_distributed (`torchrun`) distribution strategy for trainium instances in SageMaker. - PySDK PR: https://github.com/aws/sagemaker-python-sdk/pull/3424 - PT Toolkit PR: https://github.com/aws/sagemaker-pytorch-training-toolkit/pull/248 -...

*Issue #, if available:* *Description of changes:* Update documentation By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

/kind bug **What happened?** Hi, I've been looking into getting the PV and PVC setup for pods using the FSx in a peered account. I am unable to get the...

kind/bug
lifecycle/frozen