deepspeed topic

List deepspeed repositories

gdGPT

91
Stars
8
Forks
Watchers

Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.

vod

21
Stars
3
Forks
Watchers

End-to-end training of Retrieval-Augmented LMs (REALM, RAG)

LLM-Pretrain-SFT

68
Stars
14
Forks
Watchers

Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)

llms_tool

201
Stars
18
Forks
Watchers

一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测,低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。

lmdeploy

4.5k
Stars
404
Forks
Watchers

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

train_law_llm

54
Stars
8
Forks
Watchers

✏️0成本LLM微调上手项目,⚡️一步一步使用colab训练法律LLM,基于microsoft/phi-1_5、chatglm3,包含lora微调,全参微调

Toy-RecLM

51
Stars
4
Forks
Watchers

A toy large model for recommender system based on LLaMA2/SASRec/Meta's generative recommenders. Besides, note and experiments of official implementation for Meta's generative recommenders.

llm-inference

69
Stars
17
Forks
Watchers

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource...

MPP-LLaVA

357
Stars
20
Forks
Watchers

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train...