gujing

Results 6 comments of gujing

kind of bug. please @richard2006 fix it, thanks.

change code `E_seq = np.delete(E_seq, i_to_remove, axis=0)` to `E_seq = np.delete(E_seq, i_to_remove.astype(np.int), axis=0)` in telemanom/errors.py

> > Can you try downgrading `transformers` to 4.21.0? > > I tried, another bug happend: Traceback (most recent call last): File "/home/ubuntu/./ColossalAI/applications/Chat/inference/server.py", line 10, in from llama_gptq import load_quant...

for linux ```bash # server nohup python3 -m fastchat.serve.controller >> /root/server.log 2>&1 & while [ `grep -c "Uvicorn running on" /root/server.log` -eq '0' ];do sleep 1s; echo "wait server running"...

vllm 0.4.1 + qwen-14b-chat the yaml as below: ``` apiVersion: apps/v1 kind: Deployment metadata: name: vllm labels: app: vllm spec: replicas: 1 selector: matchLabels: app: vllm template: metadata: labels: app:...