OpenChatKit What kind of data to feed to the model ?

What kind of data to feed to the model ?

Open dimwael opened this issue 2 years ago • 2 comments

What kind of data to provide for finetuning ? What are the best practices for finetuning ?

Mar 13 '23 22:03 dimwael

The type of data required for fine-tuning depends on the task the model is intended to perform.

Best practices for fine-tuning include:

Using a similar data distribution as the original data
Using enough data to ensure the model can generalize well
Using multiple epochs to allow the model to learn from the new data
Using a small learning rate to prevent overfitting
Monitoring the model's performance on a validation set
Using a pre-trained model for better performance
Regularizing the model to prevent overfitting

Mar 14 '23 12:03 Rartg

If you're trying to reproduce the GPT-NeoXT-Chat-Base-20B model, you can download the dataset by running python data/OIG/prepare.py from the root of the repository.

We plan to add more documentation about fine tuning on your own data. Please see #10.

Mar 18 '23 05:03 csris

OpenChatKit OpenChatKit copied to clipboard

What kind of data to feed to the model ?

OpenChatKit
OpenChatKit copied to clipboard