deepspeed topic

List deepspeed repositories

gdGPT

91

Stars

8

Forks

Watchers

Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.

vod

21

Stars

3

Forks

Watchers

End-to-end training of Retrieval-Augmented LMs (REALM, RAG)

LLM-Pretrain-SFT

68

Stars

14

Forks

Watchers

Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)

large-language-models

llms_tool

201

Stars

18

Forks

Watchers

一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。

lmdeploy

4.5k

Stars

404

Forks

Watchers

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

fastertransformer

my-llm

46

Stars

5

Forks

Watchers

All about large language models

distributed-training

large-language-models

train_law_llm

54

Stars

8

Forks

Watchers

✏️0成本LLM微调上手项目，⚡️一步一步使用colab训练法律LLM，基于microsoft/phi-1_5、chatglm3，包含lora微调，全参微调

Toy-RecLM

51

Stars

4

Forks

Watchers

A toy large model for recommender system based on LLaMA2/SASRec/Meta's generative recommenders. Besides, note and experiments of official implementation for Meta's generative recommenders.

actions-speak-louder-than-words

large-language-models

llm-inference

69

Stars

17

Forks

Watchers

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource...

MPP-LLaVA

357

Stars

20

Forks

Watchers

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train...