zeus
zeus copied to clipboard
Training framework integration opportunities
-
PipelineFrequencyOptimizer: Large model training frameworks- Deepspeed
- Megatron-LM
- Deepspeed-Megatron
- GPT-NeoX
- Reuse training examples in each large model training framework (e.g., Llama pre-training or fine-tuning)
-
GlobalPowerLimitOptimizer- PyTorch FSDP
- Deepspeed ZeRO
PyTorch FSDP integration: #147