seldon-core Using shared models/modules across inference pipelines

Using shared models/modules across inference pipelines

Open saeid93 opened this issue 3 years ago • 1 comments

Academic systems like Rim and grandslam have the ability to share a model across multiple pipelines. As there are use cases in which a single model could be used as part of two separate pipelines in a company submitted by multiple users, it would be a good idea to just reuse the already deployed model and autoscale it based on need rather than have two separate versions of it. However, as it needs some custom policy on top of K8S scheduler it might not be an urgent change.

An implementation idea for the user interface, would be to first deploy models as separate services and then just make the connection between them through the YAML file. This needs to decouple the physical placement of models from the graph representation.

Aug 14 '22 16:08 saeid93

We will be addressing this is v2 of our APIs

Aug 15 '22 06:08 ukclivecox

seldon-core seldon-core copied to clipboard

Using shared models/modules across inference pipelines

seldon-core
seldon-core copied to clipboard