wangcl
Results
2
repositories owned by
wangcl
DeepSpeed-Chat-Extension
15
Stars
1
Forks
Watchers
This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
Vision-LLM-Alignment
40
Stars
1
Forks
Watchers
This repo contains the codes for supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) designed for vision LLMs.