Uni-Core
Uni-Core copied to clipboard
Pytorch native optim
- Use
torch.optim.AdamW
as fallback Adam implementation. - Support selecting the fused versions of the optimizers (via
--use-fused-optimizer
).
Speed: custom_fused (only available for Adam) > fused > foreach