pykoi-rlhf-finetuned-transformers
pykoi-rlhf-finetuned-transformers copied to clipboard
SFT for D2L + Pre-Training (rename of the previous SFT)
Implement SFT and use D2L as a demo case. Rename previous SFT to Pre-training and modify corresponding scripts/notebooks.
Also, please add what you have tested for this PR.