LLM-VM icon indicating copy to clipboard operation
LLM-VM copied to clipboard

Load-balancing / auto-scaling for LLM serving on Azure

Open VictorOdede opened this issue 1 year ago • 5 comments

VictorOdede avatar Oct 31 '23 15:10 VictorOdede

Hi @VictorOdede, I'd like to try this issue. Could you please provide some more information?

internot169 avatar Dec 22 '23 19:12 internot169

I would love to work on this @VictorOdede , can you share details for this issue

kaushikdaiv7 avatar Jan 07 '24 22:01 kaushikdaiv7

Hi! Vik, Can I also please work on this issue? Thank you!

lucylililiwang avatar Feb 13 '24 16:02 lucylililiwang

When we are taking care of the Load-balancing, is it alright for us to do Azure Kubernetes Service (AKS) along with Horizontal Pod Autoscaler (HPA) and Kubernetes Ingress Controller for load balancing? Thank you!

lucylililiwang avatar Feb 13 '24 16:02 lucylililiwang

sorry, when we setting up the Kubernetes cluster, which Kubernetes version should we choose? Thank you! kubernates

lucylililiwang avatar Feb 13 '24 16:02 lucylililiwang