wangcl

Results 2 repositories owned by wangcl

DeepSpeed-Chat-Extension

15
Stars
1
Forks
Watchers

This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).

Vision-LLM-Alignment

40
Stars
1
Forks
Watchers

This repo contains the codes for supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) designed for vision LLMs.