TextBlob
TextBlob copied to clipboard
Maintainers needed for language extensions
Help free TextBlob from Anglocentrism by contributing a language extension!
For instructions on how to do so, check out the development docs: http://textblob.readthedocs.org/en/dev/contributing.html#language-extensions
I've begun textblob-fr, but it would be nice if it had another maintainer (preferably one who knows French and/or has experience with NLP in French). For now, you can use textblob-fr as a template for other language extensions.
If you write more on what this would involve, I'd be interested in handling the Turkish extensions.
That would be excellent! I've added more info in the issue description
A tutorial on how to do this would be great. This is all the guidance we have right now (from the link above): "The process for developing language extensions is the same as developing model extensions. Create your part-of-speech taggers, tokenizers, parsers, etc. in the language of your choice."
Hi @sloria, I am willing to help on Arabic (if needed).
@ibrahimsharaf Excellent! AFAIK there is no Arabic extension for TextBlob.
I recommend checking out textblob-de and textblob-fr to see what an extension would look like.
Hi @sloria I contacted @ibrahimsharaf and we will work together on the arabic extension. I already wrote some updates on a local repo. Would it be better if the ar extension has a separate repo like fr and de or made internally like the en?
@adhaamehab It should be a separate repo/package so that it can be released independently of TextBlob's release schedule.
I’d also like to point out the SCOWL project which could be used to generate the en-spelling.txt file instead of maintaining one as part of this project. Similar dictionaries exist for other languages over at the aspell project.