LLM-VM icon indicating copy to clipboard operation
LLM-VM copied to clipboard

Implement FSDP for training large datasets

Open TheRealVish opened this issue 1 year ago • 1 comments

Definition of done: Implement training large models using FSDP to accelerate training on large datasets.

Reference: https://pytorch.org/blog/introducing-pytorch-fully-sharded-data-parallel-api/

TheRealVish avatar Sep 04 '23 13:09 TheRealVish

Hi, Can I please work on this issue? Thank you!

lucylililiwang avatar Feb 13 '24 16:02 lucylililiwang