ColossalAI
ColossalAI copied to clipboard
[DOC]: optimize readme of ChatGPT
📚 The doc issue
A new user is hard to start training after reading the readme.
I list some FAQs which users may concern:
- How to prepare training dataset? What does the dataset look like?
- How to save models?
- How to use the fine-tuned models to generate?
- How to use LoRA? How does it affect training?
The biggest question in my mind is: How to initialize optimizer states when finetuning a pre-trained model?
Hi, I found this picture a little confusing. I think if there is some description, it would be much better.
Hi, I found this picture a little confusing. I think if there is some description, it would be much better.
Yes, at least tell the readers: the green part is the trainable parameters and the rest parts are frozen parameters.
I think we need a detail blog to tell user what is reward model, critic model, sft-model
This picture is from InstructGPT, maybe there should be copyright information?
I think instructs LM is better than chatgpt, chatgpt is a serving, instruct LM is an algorithm
This picture is from InstructGPT, maybe there should be copyright information?
data:image/s3,"s3://crabby-images/f706f/f706f6d3dc118758319926eaef3002bdad8b0cc6" alt="image"
##Todo
- implement PPO training and fine-tuning
We should also mentioned PPO fine-tuning. This looks a lot like we haven't implemented finetune function yet, because we plan to implement PPO-ptx fine-tuning.
data:image/s3,"s3://crabby-images/973b0/973b0fb95e34c6c93785ab0050c896ea758b1236" alt="c56c4ea7-c449-40ba-8b53-84d6cb9e647c"
TODO ckpt Seriously affect the actual use of the user, need to be fixed as a priority
Hi, I found this picture a little confusing. I think if there is some description, it would be much better.
Yes, at least tell the readers: the green part is the trainable parameters and the rest parts are frozen parameters.
I've already completed a detailed doc to explain about the training process, it will be released soon.
We have updated a lot. Please check https://github.com/hpcaitech/ColossalAI/tree/main/applications/Chat This issue was closed due to inactivity. Thanks.