large-scale-language-modeling topic

List large-scale-language-modeling repositories

pipegoose

74
Stars
17
Forks
Watchers

Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*

VPGTrans

264
Stars
25
Forks
Watchers

Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.