megatron topic

List megatron repositories

megadlbot_oss

178
Stars
170
Forks
Watchers

Megatron was a telegram file management bot that helped a lot of users, specially movie channel managers to upload their files to telegram by just providing a link to it. The project initially started...

ms-swift

3.6k
Stars
310
Forks
12
Watchers

Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Visio...

pipegoose

77
Stars
17
Forks
Watchers

Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*

LLaMA-Megatron

26
Stars
2
Forks
Watchers

A LLaMA1/LLaMA12 Megatron implement.