sequence-parallelism topic

List sequence-parallelism repositories

pipegoose

77
Stars
17
Forks
Watchers

Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*

InternEvo

300
Stars
51
Forks
Watchers

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.