cortex icon indicating copy to clipboard operation
cortex copied to clipboard

Use CPU instances with GPU inference accelerator

Open vishalbollu opened this issue 5 years ago • 7 comments

Description

Instead of spinning up a GPU nodegroup, spin up a CPU nodegroup with Elastic Inference (GPU accelerated inference).

Additional Context

vishalbollu avatar Nov 28 '19 21:11 vishalbollu

+1. This is critical for a cost-effective deployment.

scribu avatar Dec 07 '19 15:12 scribu

Hi, I'd like to look into this issue if anyone can help me get started.

lezwon avatar Jul 22 '20 16:07 lezwon

@lezwon thanks for your interest!

I think the first step is to figure out how to create an eks cluster with instances that have elastic inference attached. Currently, Cortex uses eksctl to create the cluster, and based on https://github.com/weaveworks/eksctl/issues/643, it looks like eksctl might not support elastic inference yet. But I am not sure if that's the case, or if there is a workaround; it could be worth reaching out to the eksctl team to inquire.

@RobertLucian or @vishalbollu, do you have any additional context on this?

deliahu avatar Jul 22 '20 19:07 deliahu

@deliahu Thank you for the help. I'll look into the issue you mentioned with eksctl. :)

lezwon avatar Jul 23 '20 00:07 lezwon

@lezwon sounds good, thank you, keep us posted!

deliahu avatar Jul 23 '20 18:07 deliahu

This issue has been depriorized and the relevant eksctl issue is closed for inactivity but using EI would be cost saving for most of the Cortex users. Is there any plan to solve this issue in the following releases?

H4dr1en avatar Apr 06 '21 13:04 H4dr1en

@H4dr1en we have added multi instance types clusters as a feature recently. This can mitigate costs already, by allowing to run both CPU / GPU and Spot instances in the same cluster.

I know it is not remotely the same as Elastic Inference, but it is an improvement :)

We will look into Elastic Inference again soon since we are re-focusing the team's efforts on improving the Cortex UX on AWS.

miguelvr avatar Apr 06 '21 13:04 miguelvr