DeepSpeed
DeepSpeed copied to clipboard
New integration - CometMonitor
This PR introduces a new monitoring option - CometMonitor which comes up as an official integration with CometML.
The new monitor is covered with unit tests.
Notes:
- We've updated
docs/code-docs/source/monitor.rstbut it doesn't look used anymore - We've updated the "Monitoring Module" section name in
config-json.mdto be generic so the next integration won't require updating it.
@microsoft-github-policy-service agree company="Comet"
Hi, @loadams! I hope I fixed the issues in my unit tests (they revealed themselves only in multi-gpu env). Could you please re-run the tests?
Hi @loadams @alexkuzmik , @deepcharm confirmed a bug in this PR when determining the config monitor.enabled flag, and fixed it in new PR https://github.com/microsoft/DeepSpeed/pull/5633