NeSVoR Efficiency with deepspeed

Efficiency with deepspeed

Open jennydaman opened this issue 2 years ago • 1 comments

Possibly naive suggestion, I wonder if we can lower VRAM usage and/or improve speed using https://github.com/microsoft/DeepSpeed ?

Jun 23 '23 23:06 jennydaman

Thanks for the suggestion. Actually we have already used some of those techniques, e.g., mixed precision training, to improve the usage of GPU memory and efficiency. Other techniques, such as offloading, are useful for large models but might not be necessary in our case. But there might be some new techniques that I am not aware of, so I will keep an eye on it.

Jul 01 '23 14:07 daviddmc

NeSVoR NeSVoR copied to clipboard

Efficiency with deepspeed

NeSVoR
NeSVoR copied to clipboard