phiresky
phiresky
I've managed to make tantivy work fully in the browser with databases of arbitrary size (tested with a 14GB database and it works great). The code and demo is here:...
> With Tantivy's use of mmap for storing the index, what does memory usage look like to the end user on your proof of concept? In my POC I replace...
Another note: The same thing could be used with an IndexedDB backend to e.g. fix the Matrix search in the browser: https://github.com/matrix-org/seshat/issues/84 So far I've only looked at read-only but...
> It's not clear that IPFS supports byte range requests apart from the gateway-browser connection. > may have to be pre-chunked In my [above-linked article](https://phiresky.github.io/blog/2021/hosting-sqlite-databases-on-github-pages/) I actually do split the...
> For obvious reason, this is unmergeable but I assume this was not the purpose of this PR? Actually, it would be great to get as much as possible of...
> Did you find a way to parallelize the requests? Most of the requests are sequential and synchronous, I didn't change anything there, except that it optimistically prefetches more data...
I don't think it actually waits for the end of the input file, it just reads the input in chunks. So `tail -f x.csv | xsv...` works, it just outputs...
If I'm understanding correctly, the only use of Seek to get the total file size for a single `Vec::with_capacity(len)` call in the Xls reader. The capacity is also never read...
Ok I just tried to remove all references to Seek to see what happens, but it turns out that [zip::read::ZipArchive](https://docs.rs/zip/0.5.4/zip/read/struct.ZipArchive.html) also requires Seek, which is used in basically all readers...
Ok, actually I use `gnome-screenshot -a -f` now (which also works without gnome), has this behaviour, and doesn't have the rectangle redrawing bug scrot has, so if no one else...