wordfreq icon indicating copy to clipboard operation
wordfreq copied to clipboard

Adding new language "Basque"

Open Mikelhoya opened this issue 3 years ago • 2 comments

Hello my name is Mikel, I would like to know if there is any possibility of adding a new language to the library. The Basque language. And if the answer is yes how could i colaborate to make it happen. Thank you

Mikelhoya avatar Jul 07 '22 11:07 Mikelhoya

Last time I updated the input corpora, Basque just missed the cutoff for having enough text for me to consider the frequencies representative. I had left myself a note that if I finished including corpus text from OSCAR, it would enable word frequencies in Basque, as well as Estonian, Albanian, and Galician.

The company I work for now is focused on monolingual English, and I may not have an opportunity to do more multilingual corpus processing any time soon, though I really wish I could.

rspeer avatar Oct 14 '22 16:10 rspeer

Thank you very much, it isn´t urgent for me now. You are doing a good job. Thank you

Mikelhoya avatar Oct 14 '22 17:10 Mikelhoya

Closing because the wordfreq data is unlikely to be updated in any language.

rspeer avatar Jun 25 '24 14:06 rspeer