Vector Zhou
Results
1
comments of
Vector Zhou
Same issue when hosting `DeepSeek-R1-Distill-Qwen-14B` with commands `python3 -m sglang.launch_server --model-path ${MODEL_PATH} --tp 8 --dist-init-addr ${IP}:5000 --trust-remote-code --host 0.0.0.0 --port 30000 --enable-dp-attention --dp-size 8 --enable-torch-compile --torch-compile-max-bs 8` on 8 H100s.