deepspeed topic

List deepspeed repositories

l2hmc-qcd

64
Stars
7
Forks
Watchers

Application of the L2HMC algorithm to simulations in lattice QCD.

safe-rlhf

1.3k
Stars
119
Forks
Watchers

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

RLHF

283
Stars
35
Forks
Watchers

Implementation of Chinese ChatGPT

transformers-language-modeling

20
Stars
4
Forks
Watchers

Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3

KnowLM

1.2k
Stars
123
Forks
Watchers

An Open-sourced Knowledgable Large Language Model Framework.

OpenRLHF

2.1k
Stars
206
Forks
Watchers

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

ChatGLM-LoRA-RLHF-PyTorch

125
Stars
10
Forks
Watchers

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatG...

llama2-lora-fine-tuning

159
Stars
14
Forks
Watchers

llama2 finetuning with deepspeed and lora

Alpaca-LoRA-RLHF-PyTorch

56
Stars
6
Forks
Watchers

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT...

LLMs_train

36
Stars
3
Forks
Watchers

一套代码指令微调大模型