pythainlp
pythainlp copied to clipboard
Thai Natural Language Processing in Python.
Bumps [transformers](https://github.com/huggingface/transformers) from 4.36.0 to 4.38.0. Release notes Sourced from transformers's releases. v4.38: Gemma, Depth Anything, Stable LM; Static Cache, HF Quantizer, AQLM New model additions 💎 Gemma 💎 Gemma...
Add save and load model for `pythainlp.classify.param_free.GzipModel`
## Detailed description MCU ระดับเล็กๆ มักใช้ภาษา c/c++ อยากให้มี ThaiNLP ภาษา c ด้วยจะดีมากๆครับ
Chulalongkorn University release Thai Universal Dependency Treebank, consisting of 3,627 trees annotated in accordance with the Universal Dependencies (UD) framework. arXiv: https://arxiv.org/abs/2405.07586 GitHub (dataset and model): https://github.com/nlp-chula/TUD
I think `sent_tokenize` should support input function from `List[str]` type. Now, sent_tokenize support `str` type only.
From https://github.com/PyThaiNLP/pythainlp-corpus/issues/26, I think `pythainlp.transliterate.romanize` should split whitespace before romanize the text. It can improve `romanize` funtion.
### What does this changes sent_tokenizer function now also supports list of string ### What was wrong Before the changes, The sent_tokenizer function was taking string as parameter only. ###...
Hello, thank you for the library. I've written a free program for learning languages called Lute (https://github.com/LuteOrg/lute-v3), and it would be nice to add Thai support. This library looks great,...
PyThaiNLP join Hacktoberfest 2024!!! You can contribute to PyThaiNLP and get the Hacktoberfest 2024 reward. If you don't found the topic that you want to do, you can create own...
### Description Traceback (most recent call last): File "/mnt/c/Users/gpr_y/Documents/_z__py4nux/nlp/thainlp/tone_quant.py", line 32, in print(thai_word_tone_detector('อีก')) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/geebee/anaconda3/lib/python3.11/site-packages/pythainlp/util/thai.py", line 200, in thai_word_tone_detector return [(i, tone_detector(i.replace("หฺ", "ห"))) for i in _pronunciate] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File...