seldon-core icon indicating copy to clipboard operation
seldon-core copied to clipboard

multi model graphs request propagation

Open saeid93 opened this issue 3 years ago • 1 comments

Follow up to Slack channel question with @adriangonz . As part of a project, I have a set of interconnected nodes (each node is a Triton server) in which each of them has a number of models (with the same goal) and I want each request to pass through a different set of models. For example, I want request 1 to pass through model 1 in node 1 -> model 1 in node 2 -> model 1 in node 3, and another path R2. The path for each request is determined prior to run the request and can be possibly sent through metadata to the Triton servers and Seldon core graph.

181790732-dcbab48e-2658-426c-a409-a7fdf5a119e9 (2)

P.S A possible use case is that I have requests with different latency-accuracy requirements and I want to choose the path of the request based on that.

saeid93 avatar Aug 05 '22 15:08 saeid93

In general you could achieve this by

  1. Extending your graph to have a Router before each item. Although this is a challenge with Triton as that is using the V2 protocol.
  2. Utilize Triton's business logic scripting to call the models as needed

In general we will be investigating these use cases in v2 of our APIs.

ukclivecox avatar Aug 15 '22 06:08 ukclivecox

Please test in v2

ukclivecox avatar Dec 05 '22 11:12 ukclivecox