DeepSpeed icon indicating copy to clipboard operation
DeepSpeed copied to clipboard

Clone tensors to avoid torch.save bloat

Open tjruwase opened this issue 2 years ago • 5 comments

Fixes #3303

tjruwase avatar Apr 22 '23 00:04 tjruwase

TODOs:

  1. Docs
  2. Unit tests (?)

tjruwase avatar Apr 22 '23 00:04 tjruwase

@stas00, please see docs https://deepspeed.readthedocs.io/en/rtd-staging/model-checkpointing.html#avoiding-zero-checkpoint-bloat

tjruwase avatar May 02 '23 19:05 tjruwase

Looking at the rendering - the source formatting appears to be borked. It has :param: and the last section doesn't show up.

And the doc is hard to read as it refers to input, let me try to make a better suggestion

stas00 avatar May 02 '23 19:05 stas00

@stas00, thanks for the feedback. I have applied your suggestions. Please take another look.

tjruwase avatar May 02 '23 21:05 tjruwase

Looking good now!

stas00 avatar May 02 '23 21:05 stas00