server Allow triton to read from multiple cloud model repositories

Is your feature request related to a problem? Please describe. A clear and concise description of what the problem is. Ex. I'm always frustrated when [...] Trying to use Triton with ClearML-serving. Currently, there are means (with some coding) to copy the models from ClearML to Triton. However, I am looking for a means to differentiate manually deployed models vs those from ClearML. E.g. From two S3 buckets or a hybrid of local folder + S3 bucket.

Describe the solution you'd like A clear and concise description of what you want to happen. Allow Triton to take in multiple repos.

Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered. Nil.

Additional context Add any other context or screenshots about the feature request here.

Nov 21 '21 15:11 okyspace

Triton already supports such feature which can be done by specifying --model-repository option multiple times to included models from multiple repositories (Model Repository Doc)

Nov 22 '21 21:11 GuanLuo

@GuanLuo we do not yet support multiple cloud repositories for the same Triton server instance. This ticket should be left open

Nov 23 '21 00:11 CoderHam

@CoderHam will proper error be returned currently if user specifies multiple cloud location?

Nov 23 '21 00:11 GuanLuo

@CoderHam will proper error be returned currently if user specifies multiple cloud location?

Good question, I don't think we have a check for that and I don't think an appropriate error will be returned.

Nov 23 '21 00:11 CoderHam

@CoderHam do you mean triton cannot read from e.g. both aws s3 and google cloud?

Just to share, I tested with 2 x S3 buckets and 1 x local folder. Triton loaded all the models (in the 3 model repo folders) successfully. But if there are repeated model in the different model folder, there will be error.

Nov 23 '21 03:11 okyspace

hi, can i check if there is any update on this?

Jan 30 '24 02:01 okyspace

I think you could use model namespacing to avoid collisions when using multiple model directories, including with cloud providers. See this test to see an example of how to enable model namespacing: https://github.com/triton-inference-server/server/blob/e787476ea3fefec5febfe7e382a9c2c6630c124b/qa/L0_model_namespacing/test.sh

May 15 '24 19:05 dyastremsky

server server copied to clipboard

Allow triton to read from multiple cloud model repositories

server
server copied to clipboard