jesterj
jesterj copied to clipboard
Document Ingestion Framework for Search Systems
Problem is I have no artistic talent (see below). If someone would like to contribute something based on the image below that didn't make people's eyes bleed quite so badly...
This is a placeholder/parent ticket for the key feature of 2.0 when we get there. This will generally include: - [ ] Cluster formation so that nodes can access a...
Need to investigate and fix: Manifest.writeTo(Writer) has been deprecated and is scheduled to be removed in Gradle 4.0. Please use Manifest.writeTo(Object) instead.
Would like to have a built in sender for this, but I've had to remove it because I can't afford the time to chase down the stuff they keep breaking...
A more performance oriented alternative to FieldTemplateProcessor, that can join arbitrary number of fields, possibly with a delimiter.
Opening this ticket as a place to stash thoughts related to balancing load across nodes (once we have the ability to start multiple nodes that talk to each other!). Of...
Once we have a document in the pipeline we need to be able to adjust the names of fields by regex match replace. For example s/^(file_size)$/$1_i/ to allow dynamic field...
Though cloud is the primary mode for solr these days, stand-alone non-cloud install still exist. We should have a processor that supports this scenario, similar to (possibly factored from) SendToSolrCloudProcessor....
This ticket would provide a way to place the results of processing on a Kafka topic. Noting that large documents placed in a Kafka topic is an anti-pattern, the recommended...
This ticket would add a scanner implementation that read documents from a kafka topic as a consumer. When documents are large it would be expected that the item read is...