SpecForge
SpecForge copied to clipboard
Support Train Eagle-3 By DeepSpeed
Support train eagle3 by deepspeed for large model like 72B/235B
Is the training speed improved compared to the original implementation?
Now we have sgl online for training large models. we can use sglang as backend to support different models