ReplitLM
ReplitLM copied to clipboard
Does ReplitLM support gradient checkpoints?
Does ReplitLM support gradient checkpoints?
Yes, gradient checkpointing should work out of the box.