gujing comments

Results 6 comments of


                                            gujing

cloud provider v2.2.0 doesn't work for Kubernetes 1.22.x

kind of bug. please @richard2006 fix it, thanks.

Error during operation

change code `E_seq = np.delete(E_seq, i_to_remove, axis=0)` to `E_seq = np.delete(E_seq, i_to_remove.astype(np.int), axis=0)` in telemanom/errors.py

[BUG]:From server.py: ValueError: The following `model_kwargs` are not used by the model: ['token_type_ids']

same problem

[BUG]:From server.py: ValueError: The following `model_kwargs` are not used by the model: ['token_type_ids']

> > Can you try downgrading `transformers` to 4.21.0? > > I tried, another bug happend: Traceback (most recent call last): File "/home/ubuntu/./ColossalAI/applications/Chat/inference/server.py", line 10, in from llama_gptq import load_quant...

NETWORK ERROR DUE TO HIGH TRAFFIC. PLEASE REGENERATE OR REFRESH THIS PAGE. (error_code: 4)

for linux ```bash # server nohup python3 -m fastchat.serve.controller >> /root/server.log 2>&1 & while [ `grep -c "Uvicorn running on" /root/server.log` -eq '0' ];do sleep 1s; echo "wait server running"...

[Usage]: How do you setup vllm to work in k8s/openshift cluster

vllm 0.4.1 + qwen-14b-chat the yaml as below: ``` apiVersion: apps/v1 kind: Deployment metadata: name: vllm labels: app: vllm spec: replicas: 1 selector: matchLabels: app: vllm template: metadata: labels: app:...