deepspeed topics

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatG...

jackaduma

chatglm

chatglm-6b

chatgpt

deepspeed

llama2-lora-fine-tuning

159

Stars

14

Forks

Watchers

llama2 finetuning with deepspeed and lora

git-cloner

deepspeed

finetuning

llama2

lora

Alpaca-LoRA-RLHF-PyTorch

56

Stars

6

Forks

Watchers

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT...

jackaduma

alpaca

chatgpt

deepspeed

finetune

LLMs_train

36

Stars

3

Forks

Watchers

一套代码指令微调大模型

5663015

baichuan

bloom

chatglm-6b

deepspeed