Wannaphong Phatthiyaphaibun
Wannaphong Phatthiyaphaibun
I was the step foe test the model.
iapp_wiki_qa_squad is an extractive question answering dataset for Thai. It's use MIT License. - added notebook that loads the CMU Wiki QA dataset then cleans / filters it and saves...
Add chat.json for Thai from #1915
From #1903, pre-commit is error by jupyter notebook. I was export the notebook from Google colab then add to my pull request but pre-commit was error with the jupyter notebook...
`iapp_wiki_qa_squad` is an extractive question answering dataset for Thai. I was fork the dataset for Open Assistant. It's use MIT License. - added notebook that loads the CMU Wiki QA...
`tlcv2.0_oa` is a dataset that made for Open Assistant Text-only format. It was build from Thai Literature Corpora (TLC). Thai Literature Corpora (TLC) is Corpora of machine-ingestible Thai classical literature...
Hello! I am working train new Machine Translation model for Thai-English and English-Thai. It's may doesn't done in v5.0.0 deadline but I hope new model will include in the next...
**Schedule** - First Beta release: WIP - Production release: WIP See 5.1 Milestone. ## What is new? - Add TUD postag #916 - Add postag of Thai Discourse Treebank #910...
newmm is use the maximum matching algorithms, constrained by Thai Character Cluster (TCC) boundaries with improved TCC rules. It can found a ambiguous breaking points bug that slower/very slow. The...
From [Thai-NNER](https://github.com/vistec-AI/Thai-NNER/), The dataset has 4,894 docs. The dataset is licensed under CC-BY-SA 3.0. [dev.txt](https://github.com/PyThaiNLP/pythainlp/files/13638655/dev.txt) [train.txt](https://github.com/PyThaiNLP/pythainlp/files/13638653/train.txt)