spaCy icon indicating copy to clipboard operation
spaCy copied to clipboard

Add Kurdish Kurmanji language

Open cikay opened this issue 1 year ago • 2 comments

Description

I could not find a document on how to add a new language. I just checked what the existing ones have and added them for Kurdish Kurmanji. I may miss things. Please let me know and also if there is you could provide a doc link for it.

Types of change

Adding a new language Kurdish Kurmanji

Checklist

  • [x] I confirm that I have the right to submit this contribution under the project's MIT license.
  • [x] I ran the tests, and all new and existing tests passed.
  • [ ] My changes don't require a change to the documentation, or if they do, I've added all the required information.

cikay avatar Jul 08 '24 18:07 cikay

Here is a universal dependency corpus https://github.com/UniversalDependencies/UD_Kurmanji-MG for Kurmanji

cikay avatar Jul 08 '24 18:07 cikay

https://spacy.io/usage/linguistic-features#language-data Ok, I found it. Here, it is explained about language files and functions shortly.

cikay avatar Jul 28 '24 18:07 cikay

Thanks! I haven't reviewed it carefully but it doesn't seem problematic to include it. I don't think I'll have time to add the models just now though.

honnibal avatar Sep 09 '24 09:09 honnibal