Jinghan Li
Results
2
comments of
Jinghan Li
> Does that mean if I use deepspeed integration with huggingface's `Trainer`, the parameters will be updated twice in each step? no, it won't.
https://github.com/huggingface/accelerate/pull/3819 I created this PR to debug parameters for a short time, feel free to use it. But even with it still heavily depends on the deepspeed engine.