open-tamil
open-tamil copied to clipboard
Open Source Tamil NLP Tools - தமிழ் இயற்கை மொழி பகுப்பாய்வு நிரல்தொகுப்பு
SciKitLearn classifier is present in the examples of OpenTamil to classify if given sequence of Tamil letters is English transliteration or original Tamil letters. e.g. காழ்ப்பு -vs- பக்கெட்
Collect Unigram data from Project Madurai, Wikipedia
OpenTamilWebApp demo - numeral generation audio [m/f] voice generation can be shown via the HTML tag and WAV synthesis module in Open-Tamil examples code. It would be a nice example...
Parallel corpora for Tamil. இது இருந்தால் "organic" என்ற சொல்லை "ஒர்கனிக்" என்று தமிழில் ஒலி மாற்றி எழுதினால், இதனை நாம் "இயற்கையான" என்று தமிழ்படுத்துவதற்கு உதவும்.

http://www.quillpad.in gives better transliteration for tamil. its source is here. https://github.com/CognirelTech/Quillpad-Server Make use of its xml files and rules to add better transliteration for open-tamil
please make NLTK pos taggers and corpus reader etc... for tamil language.
Continue work started by @arulalant to add reST docs for open-tamil classes and methods. Currently the following modules are exported to user as API via pip install. 1. tamil 2....
exact translation of english to tamil and tamil to english . not like google translator.
Auto lang mode needs a list of most common Tamil words https://github.com/emacsmirror/auto-lang