pagefind icon indicating copy to clipboard operation
pagefind copied to clipboard

Adding support for Thai text segmentation

Open mattleff opened this issue 1 year ago • 1 comments

First off, thanks for a great library! 🙏🏻

I'm wanting to use Pagefind in a project that has Thai text and I'm interested in adding support for Thai in Pagefind. I see that Charabia has a Thai segmenter (https://github.com/meilisearch/charabia/pull/114). Can you give me some pointers toward what I'd need to do to enable Thai segmentation in Pagefind?

mattleff avatar Oct 16 '24 08:10 mattleff

Ah, good note! (Sorry for the delay).

We'll need to enable the thai feature for charabia, and then configure that language as an extended language in Pagefind.

I can make sure to include this in the next release!

bglw avatar Nov 20 '24 19:11 bglw

Is there any update on this? I can't seem to find the documentation related to extending support language to Thai.

Phon1209 avatar Apr 17 '25 17:04 Phon1209

👋 This has shipped in Pagefind v1.4.0 🎉

bglw avatar Sep 01 '25 06:09 bglw