fsdp topic

List fsdp repositories

podplex

17
Stars
4
Forks
Watchers

πŸ¦ΎπŸ’»πŸŒ distributed training & serverless inference at scale on RunPod

distributed-training-guide

540
Stars
55
Forks
540
Watchers

Best practices & guides on how to write distributed pytorch training code

META LLAMA3 GENAI Real World UseCases End To End Implementation Guide

torchft

455
Stars
51
Forks
455
Watchers

Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)

SpecForge

500
Stars
112
Forks
500
Watchers

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.