Justin
Results
2
comments of
Justin
> [@justinSmileDate](https://github.com/justinSmileDate) Thanks for the interest. By default we take advantage of the k8s service to distribute the inference traffic. Additionally we do have a more complicated design of and...
> depending on the vendor out of the box, it uses k8s round robin, in our runtimes we use sgl router for better load balancing as well as for pd...