elasticsearch semantic_text - document ingestion inference

semantic_text - document ingestion inference

Open carlosdelest opened this issue 9 months ago • 1 comments

Adds inference calculation to ingested documents, viaShardBulkInferenceActionFilter.

This action filter intercepts TransportShardBulkActions (before being sent to the data nodes) and calculates inference using the InferenceServiceRegistry / ModelRegistry. Batching is done at the BulkRequest level, so there will be a single inference request for each bulk request (maximum of 512 per request).

There's some nuances with copy_to support, as every source of a copy_to that targets a semantic_text must be inferred as well.

Apr 30 '24 16:04 carlosdelest

Pinging @elastic/es-search (Team:Search)

May 03 '24 19:05 elasticsearchmachine

Thanks @Mikep86 and @benwtrent for your reviews!

May 09 '24 17:05 carlosdelest

@elasticmachine update branch

May 13 '24 08:05 carlosdelest

Follow-up work to be done in https://github.com/elastic/elasticsearch/pull/108549

May 13 '24 09:05 carlosdelest

elasticsearch elasticsearch copied to clipboard

semantic_text - document ingestion inference

elasticsearch
elasticsearch copied to clipboard