Michael McCandless
Michael McCandless
> I'm leaning towards a [simple GitHub pages site](https://docs.github.com/en/pages) (thank you @msokolov for the idea) I enabled pages for the [`luceneutil` repo](https://github.com/mikemccand/luceneutil) and pushed a copy of the current nightly...
A nice side effect of this is that the long running (13+ years now!) nightly reports will be backed up via git/GitHub and no longer single sourced on my home...
Phew, OK, I think nightly benchy is now successfully publishing automatically to https://mikemccand.github.io/lucenenightly (using GitHub pages). Last night's run "just worked". I had to create a new public repo (https://github.com/mikemccand/lucenenightly)...
> > FYI: I clicked on a few random links and found a 404 https://mikemccand.github.io/luceneutil/analyzers.html although this page does seem to exist on the current site > > Hmm I'll...
I also aliased (CNAMEd) [benchmarks.mikemccandless.com](https://benchmarks.mikemccandless.com/) -- GitHub pages makes this simple-ish, yay.
I've also put these two files on `https://githubsearch.mikemccandless.com`: ``` enwiki-20120502-lines-1k-fixed-utf8-with-random-label.txt.lzma 5.9G wikimedium500.tasks 289K enwiki-20110115-lines-1k-fixed-with-random-labels.bin 25G ``` I will try to find a better permanent ish home for these sources ...
Oooh we have an official S3 bucket to use now? I had already uploaded the benchy corpus files to my own S3 bucket ... I think the URLs are in...
> [@mikemccand](https://github.com/mikemccand) would you be able to expose the files [@dsmiley](https://github.com/dsmiley) rescued on your server? oh, hmm, not I haven't yet. sorry, catching up. eventually consistent Mike!
> I guess thius 94GB comes from `33M x 768 x 4` bytes? Frankly I never test with indexes > ~2M docs, but maybe there is a call for the...
This is a neat idea -- it would allow the user to accept some "lossy compression" when they know/expect that loss will be minor for their use case. Sort of...