open_llama
open_llama copied to clipboard
Corpora for Ukrainian
Hello.
There is a corpora call Ubercorpus for Ukrainian you can add to the project: https://lang.org.ua/en/corpora/#anchor4
In a few days will be UNLP, an event from Ukrianian NLP community and there will be presented the second version of the corpus with larger size.
Would be nice to have Ukrainian Ubercorpus to OpenLLama https://huggingface.co/openlm-research/open_llama_7b_preview_200bt/discussions/1