pythainlp
pythainlp copied to clipboard
Thai Natural Language Processing in Python.
## Detailed description From https://github.com/PyThaiNLP/thai-synonym, I think we can add Thai synonym corpus to PyThaiNLP with search by word and output the data from the corpus. ## Possible implementation -...
I think we should doesn't use any dataset or model that does not compatible with Open Source in our project (PyThaiNLP). If the license that doesn't compatible with open source,...
## Detailed description Hi, are there any plans to add Thai word detokenizers (put list of tokens back to string)? ## Context There are some cases where the users need...
Hello! I am working train new Machine Translation model for Thai-English and English-Thai. It's may doesn't done in v5.0.0 deadline but I hope new model will include in the next...
### Description "You are using a model of type camembert to instantiate a model of type roberta. This is not supported for all configurations of models and can yield errors."...
### Description I am trying to run the below code: from pythainlp.translate.en_th import EnThTranslator, ThEnTranslator enth = EnThTranslator() enth.translate("I Like dogs") It works fine on Google coolabs but it's only...
In a couple days before, I've seen that we also have a **chat/generate** features that utilize **wangchanglm** as a current LLM model for text generation ability. Moreover, there has an...
**Schedule** - First Beta release: WIP - Production release: WIP See 5.1 Milestone. ## What is new? - Add TUD postag #916 - Add postag of Thai Discourse Treebank #910...
newmm is use the maximum matching algorithms, constrained by Thai Character Cluster (TCC) boundaries with improved TCC rules. It can found a ambiguous breaking points bug that slower/very slow. The...
From [Thai-NNER](https://github.com/vistec-AI/Thai-NNER/), The dataset has 4,894 docs. The dataset is licensed under CC-BY-SA 3.0. [dev.txt](https://github.com/PyThaiNLP/pythainlp/files/13638655/dev.txt) [train.txt](https://github.com/PyThaiNLP/pythainlp/files/13638653/train.txt)