FastChat icon indicating copy to clipboard operation
FastChat copied to clipboard

Have you considered Cerebras-GPT-13B

Open aamir-gmail opened this issue 2 years ago • 1 comments

Would this be suitable as the base model https://huggingface.co/cerebras/Cerebras-GPT-13B seems to be free of any licensing limitations

aamir-gmail avatar Apr 08 '23 05:04 aamir-gmail

We have done some study and our temporal conclusion is that the backbone model Cerebras-GPT has much lower quality than the llama, due to pertaining dataset size, and data filtering strategies.

See the page 4, footnote 1 of the Cerebras-GPT paper: https://arxiv.org/pdf/2304.03208.pdf

Hence we prefer to look into models like Flan-T5 at this moment.

More discussion is welcome.

zhisbug avatar Apr 12 '23 01:04 zhisbug

Closing as it is not planned.

zhisbug avatar Apr 20 '23 23:04 zhisbug

Meanwhile, we have supported Dolly-v2, koala, and we're considering open assistant and probably the stabilityLM

zhisbug avatar Apr 20 '23 23:04 zhisbug