skb888 comments

Results 11 comments of


                                            skb888

multi-node download-model-to-pvc.yaml example not exist

Hi, Does any one can help update or share the latest doc for using multi-node to deploy LLM model? Thanks a lot.

multi-node download-model-to-pvc.yaml example not exist

Thanks for sharing. I have tested and faced two issues. 1. Hugging Face client version (latest huggingface_hub) no longer uses use_auth_token. You may need Replace use_auth_token= with token= 2. Also,...

multi-node download-model-to-pvc.yaml example not exist

Another new issue I face when I move to next steps. In step3. Create a ServingRuntime I change from pipelineParallelSize: 1 to pipelineParallelSize: 2 in the https://github.com/kserve/kserve/blob/master/config/runtimes/kserve-huggingfaceserver-multinode.yaml. OW, I will...

multi-node download-model-to-pvc.yaml example not exist

Thanks for the response. It is happening in the step2 Download the Model to the PVC. I change the memory from "1Gi" to "10Gi" so that I could run python...

multi-node download-model-to-pvc.yaml example not exist

Here is the detailed errors I have faced when I run: kubectl apply -f kserve-huggingfaceserver-multinode.yaml Error from server (Forbidden): error when applying patch: {"metadata":{"annotations":{"kubectl.kubernetes.io/last-applied-configuration":"{\"apiVersion\":\"serving.kserve.io/v1alpha1\",\"kind\":\"ClusterServingRuntime\",\"metadata\":{\"annotations\":{},\"name\":\"kserve-huggingfaceserver-multinode\"},\"spec\":{\"annotations\":{\"prometheus.kserve.io/path\":\"/metrics\",\"prometheus.kserve.io/port\":\"8080\"},\"containers\":[{\"args\":[\"--model_name={{.Name}}\"],\"command\":[\"bash\",\"-c\",\"export MODEL_DIR_ARG=\\\"\\\"\\nif [[ ! -z ${MODEL_ID} ]]\\nthen\\n...

multi-node download-model-to-pvc.yaml example not exist

After I switch PipelineParallelSize to 2. then I run step4 Deploy the model in KServe V0.15.0: Here is the yaml file I refer to given example: https://kserve.github.io/archive/0.15/modelserving/v1beta1/llm/huggingface/multi-node/#3-create-a-servingruntime I have added...

skb888

multi-node download-model-to-pvc.yaml example not exist

multi-node download-model-to-pvc.yaml example not exist

multi-node download-model-to-pvc.yaml example not exist

multi-node download-model-to-pvc.yaml example not exist

multi-node download-model-to-pvc.yaml example not exist

multi-node download-model-to-pvc.yaml example not exist

Ollama deepseek-r1 or qwen3 not work

Ollama deepseek-r1 or qwen3 not work

Ollama deepseek-r1 or qwen3 not work

Ollama deepseek-r1 or qwen3 not work