LMFlow icon indicating copy to clipboard operation
LMFlow copied to clipboard

License for the non-LLaMA based models

Open Acrobot opened this issue 2 years ago • 1 comments

Hi, I can see that you released models that are not based on LLaMA. What datasets were they trained on, though? I can see in data/download.sh you download datasets such as Alpaca, which is explicitly not approved for commercial use. However, in your README, you mention "🚀Release Robin-7B (based on LLaMA-7B), and two models for commercial use: Parakeets-2.7B (based on GPT-NEO-2.7B) and Cokatoo-7B (based on StableLM-7B)". Are you sure that you are not using any datasets that are not approved for commercial use in those models?

Acrobot avatar Apr 26 '23 13:04 Acrobot

Hi, Thank you for your interest!

For these two models, we use our own curated dataset instead of Alpaca data. The dataset will be released later. Feel free to use them under Apache 2.0 license.

shizhediao avatar Apr 26 '23 13:04 shizhediao

This issue has been marked as stale because it has not had recent activity. If you think this still needs to be addressed please feel free to reopen this issue. Thanks

shizhediao avatar Jun 19 '23 11:06 shizhediao