DeepSpeed
DeepSpeed copied to clipboard
Improve overflow handling in ZeRO
Fix #5241: Improve overflow handling
- [x] ZeRO 1
- [x] ZeRO 2
- [ ] ZeRO 3
- [ ] BF16Optimizer
Enable pydantic configuration for mixed precision
- [x] bf16
- [x] fp16
@delock, @inkcherry, can you please help investigate the failing xpu-max1100 CI? Thanks!
@delock, @inkcherry, can you please help investigate the failing xpu-max1100 CI? Thanks!
@tjruwase thanks! Our engineer is looking into it.
Any ETA on this for merge?
Any ETA on this for merge? Since CI looks to now be fine, this should be merged by 06/13/25. Thanks for the patience.