open_llama icon indicating copy to clipboard operation
open_llama copied to clipboard

Corpora for Ukrainian

Open egorsmkv opened this issue 2 years ago • 1 comments

Hello.

There is a corpora call Ubercorpus for Ukrainian you can add to the project: https://lang.org.ua/en/corpora/#anchor4

In a few days will be UNLP, an event from Ukrianian NLP community and there will be presented the second version of the corpus with larger size.

egorsmkv avatar May 03 '23 11:05 egorsmkv

Would be nice to have Ukrainian Ubercorpus to OpenLLama https://huggingface.co/openlm-research/open_llama_7b_preview_200bt/discussions/1

podarok avatar May 03 '23 14:05 podarok