megatron-lm topic

List megatron-lm repositories

Annotated-ML-Papers

190
Stars
16
Forks
Watchers

Annotations of the interesting ML papers I read

pipegoose

77
Stars
17
Forks
Watchers

Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*

LLaMA-Megatron

26
Stars
2
Forks
Watchers

A LLaMA1/LLaMA12 Megatron implement.

Odysseus-Transformer

47
Stars
1
Forks
Watchers

Odysseus: Playground of LLM Sequence Parallelism

ReaLHF

95
Stars
4
Forks
Watchers

Super-Efficient RLHF Training of LLMs with Parameter Reallocation