Liam Bigelow
Liam Bigelow
π this is now on the current Pagefind alpha release (`v1.4.0-alpha.1`) if you'd like to take it for a spin :) Docs here: https://unreleased.pagefind.app/docs/indexing/#indexing-special-characters Will close out this issue once...
π This has shipped in [Pagefind v1.4.0](https://github.com/Pagefind/pagefind/releases/tag/v1.4.0) π
π @thebrightsideofmathematics Yes, the v1.1.0 release changed much of the ranking algorithm. You can see the release notes for this here: https://github.com/CloudCannon/pagefind/releases/tag/v1.1.0 I wouldn't expect the weighting to be quite...
I'm going to look at the ranking again in an upcoming release, to resolve some of the current issues around lack of control. I'll update here when I do :)
Good idea! I'm overdue to rework the configuration internally to allow richer config files (currently it unifies the config file and the CLI arguments, which is blocking complex config structures)....
Hi @awhitford β this file is vendored in from another package, so updating the source is likely to revert back at some point. It would be better to have this...
Hmm, under the hood [charabia defers to jieba-rs](https://github.com/meilisearch/charabia/blob/ae07a589ab5fa825e7fb68777b76f81e155be1b9/charabia/Cargo.toml#L42) for segmentation in Chinese, so I wouldn't expect any behavior to change with using jieba directly. Is there something I'm missing?
Oh I see, yes Pagefind doesn't use _anything_ for segmentation when searching βΒ charabia is only used when indexing the site using the Pagefind binary, it's not a web dependency....
Thanks! I'll get into reviewing this in the coming few days π
This all looks good to me so far! :)