Eagle icon indicating copy to clipboard operation
Eagle copied to clipboard

Finetuning on specific datasets

Open HashmatShadab opened this issue 1 year ago • 2 comments
trafficstars

Is there an option in the codebase to do the finetuning on only selected datasets mentioned in the readme?

HashmatShadab avatar Sep 06 '24 09:09 HashmatShadab

Yes, you can convert your dataset into LLaVA's format and update the data path in the script accordingly.

Essentially, you'll need to transform your annotations into a list of conversation data. For more details, please refer to the example annotation JSON file. If you run into any issues, feel free to reach out to us.

If you dataset is small, please consider using efficient tuning techniques like LoRA.

flyinglynx avatar Sep 10 '24 19:09 flyinglynx

Thank you for explaining. I was specifically talking about using specific datasets for finetuning that are mentioned in the readme. So for that i can just update the json file

HashmatShadab avatar Sep 10 '24 20:09 HashmatShadab