sphinx-immaterial Update to mkdocs-material >= 9

mkdocs-material v9 has been released with the "rich search" functionality, which basically amounts to including a limited set of HTML tags in the search snippets, rather than using just the text content. Since we have a separate search backend we can't just get this functionality for free on the backend side, but it would be interesting to look at how easily we can implement it.

Currently, to extract snippets, the client-side javascript code downloads the full HTML of each candidate result page, splits it into sections, and extracts the text to find matches of the search terms. We could probably modify this to retain some HTML tags when generating snippets, in order to provide a similar "rich search" display as in mkdocs-material.

Alternatively, we could preprocess each document at build time and output a stripped version of the document in jsonp format. That might make the client-side work more efficient (unclear whether it would significantly reduce the amount of data that must be fetched per candidate result).

Jan 09 '23 21:01 jbms

Re: search backend -- Has anyone looked into using lunr.py to actually build a Lunr index? https://github.com/yeraydiazdiaz/lunr.py

Mar 27 '23 19:03 kartben

The thing that I don't especially like about lunr is that the "search index" that must be downloaded by clients contains the entire text content of the website, which I think would be problematic for large sites. In contrast, the Sphinx search index contains only:

document names
document titles
map indicating for each word that is present, which documents contain it
sphinx domain object names and synopses

I expect that to be significantly smaller, though I haven't done any benchmarks.

Mar 27 '23 20:03 jbms

Its been over a year

Well, I just tried to merge updates from upstream, but

the search integration upstream was significantly changed in JS. Again, I'm out of my depth there with JS (especially since its actually rxjs)
the significant refactor of HTML templates and CSS/JS sources.
the addition of the toc folow feature that is already implemented here
Some new features that we'd have to re-implement in sphinx-immaterial python sources

Needless to say, I failed to merge updates from upstream. It would be embarrassing and unproductive to push my local attempt into a branch.

With the lack of regular attention, this project has become a mess for merging updates from upstream. I can do maintenance, but

there's an inherent limited quality of life in maintenance mode
maintenance mode can't live up to what users would expect from a mkdocs-material port to sphinx

Mar 29 '24 01:03 2bndy5

I think I could take care of the merging (especially for search) but do you think you could take care of implementing the new features that require separate sphinx/python integration, like page icons, etc.?

Mar 29 '24 02:03 jbms

The new features being implemented wouldn't need to block the merge but it would be nice not to lose track of them.

Mar 29 '24 02:03 jbms

Python is my abode. You need something done in python (or CSS)? I can (& will) help out there. Its just the JS part of this project that I can't do on my own. And there is a bunch of tweaking to the build script upstream.

Mar 29 '24 03:03 2bndy5

The new features being implemented wouldn't need to block the merge but it would be nice not to lose track of them.

My thinking exactly. My instinct tells me to start a repo "project" (kinda like github scrum) to group issues that would track the new features. I guess, since I can't control that here, we could just use issue labels instead though.

Mar 29 '24 03:03 2bndy5

sphinx-immaterial sphinx-immaterial copied to clipboard

Update to mkdocs-material >= 9

Its been over a year

sphinx-immaterial
sphinx-immaterial copied to clipboard