Patrick (Gus) Heck
Patrick (Gus) Heck
The original vision for this product was related to a system that was continuously accepting new data, but some use cases involve batch data loads and so a command line...
threw 33 million 1k docs at a fairly simple ingest and the threads for our steps don't seem to be doing much  There is no thread that seems to...
The current design focused on getting the information in place, but an oversight has left a ticking time bomb. Eventually we will have two artifacts that have a name collision...
CodeCov is apparently getting ratelimited by github, causing errors with the message: `Unable to locate build via Github Actions API. Please upload with the Codecov repository upload token` However putting...
The current fault tolerance achieves it's goal but if it resumes a very large scan it will spend a period of time hashing documents and determining that it has already...
At the moment none of our scanners have the ability to detect if a previously indexed document has disappeared. IIRC the old version of File Scanner that was based on...
As a precursor to #115 we will want to ensure that the current node is optimizing for the rate limiting step. The basic task is to identify steps that are...
Reformatting the license makes it tedious to verify that it actually is a verbatim copy of the ASL 2.0 license. Using the exact format it was published in originally makes...