Safeguarding against usage of uninitialized DistributedManager
Modulus Pull Request
Description
closes #474
Should be merged in after #469
Checklist
- [x] I am familiar with the Contributing Guidelines.
- [x] New or existing tests cover these changes.
- [x] The documentation is up to date with these changes.
- [x] The CHANGELOG.md is up to date with these changes.
- [x] An issue is linked to this pull request.
Dependencies
None
/blossom-ci
/blossom-ci
/blossom-ci
/blossom-ci
What it does makes sense. Is this fix supposed to safeguard against initialized DM but requesting an uninitialized distributed group?
@azrael417 Not quite. This PR is safeguarding against using the manager before calling DistributedManager.initialize() first. There was a bug in CorrDiff where this was silently happening causing a multi GPU job to behave like independent single GPU jobs since that's the default.
/blossom-ci
/blossom-ci