pythainlp icon indicating copy to clipboard operation
pythainlp copied to clipboard

Thai Natural Language Processing in Python.

Results 71 pythainlp issues
Sort by recently updated
recently updated
newest added
trafficstars

## Detailed description From https://github.com/PyThaiNLP/thai-synonym, I think we can add Thai synonym corpus to PyThaiNLP with search by word and output the data from the corpus. ## Possible implementation -...

I think we should doesn't use any dataset or model that does not compatible with Open Source in our project (PyThaiNLP). If the license that doesn't compatible with open source,...

## Detailed description Hi, are there any plans to add Thai word detokenizers (put list of tokens back to string)? ## Context There are some cases where the users need...

Hello! I am working train new Machine Translation model for Thai-English and English-Thai. It's may doesn't done in v5.0.0 deadline but I hope new model will include in the next...

refactoring

### Description "You are using a model of type camembert to instantiate a model of type roberta. This is not supported for all configurations of models and can yield errors."...

bug

### Description I am trying to run the below code: from pythainlp.translate.en_th import EnThTranslator, ThEnTranslator enth = EnThTranslator() enth.translate("I Like dogs") It works fine on Google coolabs but it's only...

bug

In a couple days before, I've seen that we also have a **chat/generate** features that utilize **wangchanglm** as a current LLM model for text generation ability. Moreover, there has an...

enhancement
Hacktoberfest

**Schedule** - First Beta release: WIP - Production release: WIP See 5.1 Milestone. ## What is new? - Add TUD postag #916 - Add postag of Thai Discourse Treebank #910...

documentation

newmm is use the maximum matching algorithms, constrained by Thai Character Cluster (TCC) boundaries with improved TCC rules. It can found a ambiguous breaking points bug that slower/very slow. The...

bug

From [Thai-NNER](https://github.com/vistec-AI/Thai-NNER/), The dataset has 4,894 docs. The dataset is licensed under CC-BY-SA 3.0. [dev.txt](https://github.com/PyThaiNLP/pythainlp/files/13638655/dev.txt) [train.txt](https://github.com/PyThaiNLP/pythainlp/files/13638653/train.txt)

enhancement
Hacktoberfest