clipper icon indicating copy to clipboard operation
clipper copied to clipboard

Does Clipper support multiple workers?

Open aksnzhy opened this issue 6 years ago • 2 comments

I have 4 GPUs on my PC and I want to use them to serve one model in a load-balanced way. Does Clipper support that?

aksnzhy avatar Jun 18 '18 21:06 aksnzhy

Yes it does. You can create multiple replicas of a model and Clipper will automatically load-balance requests between them. However, Clipper does not yet support deploying models on GPUs automatically. You can instead do this manually by starting the Docker containers yourself using nvidia-docker after registering your model. You can look at the function that Clipper calls to start containers to see which labels and environment variables are normally set.

dcrankshaw avatar Jun 19 '18 16:06 dcrankshaw

I may be wrong, but instead of starting the docker containers yourself manually, you can simply make the nvidia-docker runtime the default docker runtime

I haven't tested it, but presumably this would make every docker container started on your system have GPU access.

lgendrot avatar Jul 01 '18 21:07 lgendrot