open-tamil icon indicating copy to clipboard operation
open-tamil copied to clipboard

Open Source Tamil NLP Tools - தமிழ் இயற்கை மொழி பகுப்பாய்வு நிரல்தொகுப்பு

Results 83 open-tamil issues
Sort by recently updated
recently updated
newest added

SciKitLearn classifier is present in the examples of OpenTamil to classify if given sequence of Tamil letters is English transliteration or original Tamil letters. e.g. காழ்ப்பு -vs- பக்கெட்

enhancement

Collect Unigram data from Project Madurai, Wikipedia

enhancement

OpenTamilWebApp demo - numeral generation audio [m/f] voice generation can be shown via the HTML tag and WAV synthesis module in Open-Tamil examples code. It would be a nice example...

enhancement

Parallel corpora for Tamil. இது இருந்தால் "organic" என்ற சொல்லை "ஒர்கனிக்" என்று தமிழில் ஒலி மாற்றி எழுதினால், இதனை நாம் "இயற்கையான" என்று தமிழ்படுத்துவதற்கு உதவும்.

enhancement

![sandhi checker issue](https://user-images.githubusercontent.com/20999104/41199837-cae94c88-6cb6-11e8-9394-74b263c960b4.png)

bug

http://www.quillpad.in gives better transliteration for tamil. its source is here. https://github.com/CognirelTech/Quillpad-Server Make use of its xml files and rules to add better transliteration for open-tamil

enhancement

please make NLTK pos taggers and corpus reader etc... for tamil language.

enhancement

Continue work started by @arulalant to add reST docs for open-tamil classes and methods. Currently the following modules are exported to user as API via pip install. 1. tamil 2....

exact translation of english to tamil and tamil to english . not like google translator.

Auto lang mode needs a list of most common Tamil words https://github.com/emacsmirror/auto-lang