ColossalAI [DOC]: optimize readme of ChatGPT

📚 The doc issue

A new user is hard to start training after reading the readme.

I list some FAQs which users may concern:

How to prepare training dataset? What does the dataset look like?
How to save models?
How to use the fine-tuned models to generate?
How to use LoRA? How does it affect training?

Feb 15 '23 06:02 ver217

The biggest question in my mind is: How to initialize optimizer states when finetuning a pre-trained model?

Feb 15 '23 06:02 feifeibear

Hi, I found this picture a little confusing. I think if there is some description, it would be much better.

Feb 15 '23 06:02 Gy-Lu

Hi, I found this picture a little confusing. I think if there is some description, it would be much better.

Yes, at least tell the readers: the green part is the trainable parameters and the rest parts are frozen parameters.

Feb 15 '23 06:02 feifeibear

I think we need a detail blog to tell user what is reward model, critic model, sft-model

Feb 15 '23 06:02 Fazziekey

This picture is from InstructGPT, maybe there should be copyright information?

Feb 15 '23 06:02 Gy-Lu

I think instructs LM is better than chatgpt, chatgpt is a serving, instruct LM is an algorithm

Feb 15 '23 06:02 Fazziekey

This picture is from InstructGPT, maybe there should be copyright information?

Thanks for your remind, and we mention the reference here.

Feb 15 '23 07:02 ht-zhou

##Todo

implement PPO training and fine-tuning

We should also mentioned PPO fine-tuning. This looks a lot like we haven't implemented finetune function yet, because we plan to implement PPO-ptx fine-tuning.

Feb 15 '23 08:02 binmakeswell

c56c4ea7-c449-40ba-8b53-84d6cb9e647c Clear error message or doc to remind user 'pip install .'

Feb 15 '23 09:02 binmakeswell

TODO ckpt Seriously affect the actual use of the user, need to be fixed as a priority

Feb 15 '23 10:02 binmakeswell

Hi, I found this picture a little confusing. I think if there is some description, it would be much better.

Yes, at least tell the readers: the green part is the trainable parameters and the rest parts are frozen parameters.

I've already completed a detailed doc to explain about the training process, it will be released soon.

Feb 15 '23 11:02 ht-zhou

We have updated a lot. Please check https://github.com/hpcaitech/ColossalAI/tree/main/applications/Chat This issue was closed due to inactivity. Thanks.

Apr 18 '23 09:04 binmakeswell

ColossalAI ColossalAI copied to clipboard

[DOC]: optimize readme of ChatGPT

📚 The doc issue

ColossalAI
ColossalAI copied to clipboard