ray_vllm_inference How can we do the same, using KubeRay, RayCluster and RayServe?

How can we do the same, using KubeRay, RayCluster and RayServe?

Open WinsonSou opened this issue 1 year ago • 2 comments

So i've got a kubernetes cluster, installed with KubeRay Operator and created a RayCluster, but how do i then create a manifest for Rayserve to serve, say a Llama2 or Mistral 7b? appreciate your help and thank you in advance!

Jan 05 '24 09:01 WinsonSou

Sorry but I'm not working with Kubernetes. Please ask at the Anyscale #Kubernetes Slack channel and ask the question. There is a link to Slack on there website.

Jan 05 '24 15:01 asprenger

So i've got a kubernetes cluster, installed with KubeRay Operator and created a RayCluster, but how do i then create a manifest for Rayserve to serve, say a Llama2 or Mistral 7b? appreciate your help and thank you in advance!

I see he set it up as the same as instructed in the document of Anyscale. You can follow this https://docs.ray.io/en/latest/cluster/getting-started.html to try his project. I am trying to set up his project again with this

Apr 23 '24 04:04 TrieuLe0801

ray_vllm_inference ray_vllm_inference copied to clipboard

How can we do the same, using KubeRay, RayCluster and RayServe?

ray_vllm_inference
ray_vllm_inference copied to clipboard