NeSVoR icon indicating copy to clipboard operation
NeSVoR copied to clipboard

Efficiency with deepspeed

Open jennydaman opened this issue 2 years ago • 1 comments

Possibly naive suggestion, I wonder if we can lower VRAM usage and/or improve speed using https://github.com/microsoft/DeepSpeed ?

jennydaman avatar Jun 23 '23 23:06 jennydaman

Thanks for the suggestion. Actually we have already used some of those techniques, e.g., mixed precision training, to improve the usage of GPU memory and efficiency. Other techniques, such as offloading, are useful for large models but might not be necessary in our case. But there might be some new techniques that I am not aware of, so I will keep an eye on it.

daviddmc avatar Jul 01 '23 14:07 daviddmc