PaLM icon indicating copy to clipboard operation
PaLM copied to clipboard

Will instruction fine tuned models be made available as well

Open allthingssecurity opened this issue 2 years ago • 8 comments

Will instruction fine tuned models be made available as well for this

allthingssecurity avatar May 09 '23 02:05 allthingssecurity

@allthingssecurity All instruction-finetuned models on FLAN will be made publicly available as well.

conceptofmind avatar May 09 '23 02:05 conceptofmind

Thanks for such a quick reply. When can we expect the same?

allthingssecurity avatar May 09 '23 02:05 allthingssecurity

Thanks for such a quick reply. When can we expect the same?

The 2.1b model is training now. 2b won't be done for days. So after that finishes I will start training all of the flan-PaLM models.

conceptofmind avatar May 09 '23 02:05 conceptofmind

Please let me know if I can help. I can dedicate some compute for it. Have already done some finetuning for Flan models in past

allthingssecurity avatar May 09 '23 04:05 allthingssecurity

Will there be bigger models than 2B?

Njasa2k avatar May 09 '23 13:05 Njasa2k

Will there be bigger models than 2B?

That is largely dependent on whether CarperAI and StabilityAI want to pursue larger training runs for PaLM. I can say that there is a plan to train a much larger Sparrow model similar to PaLM with RLHF on more tokens.

You can join the CarperAI discord to follow the projects: https://discord.gg/canadagoose

conceptofmind avatar May 09 '23 14:05 conceptofmind

Hello Team,

First of all, great work and this is super helpful for the open source community. I wanted to check if there are any updates on the instruction fine-tuned models?

Thanks.

varunnathan avatar Apr 06 '24 03:04 varunnathan

I do not have access to any gpus. So no models will be trained.

conceptofmind avatar Apr 06 '24 16:04 conceptofmind