model-parallization topic
List
model-parallization repositories
gdGPT
91
Stars
8
Forks
Watchers
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.