jtmer

Results 4 comments of jtmer

I got the same error. I only have 1 A800 gpu card. When I just run the benchmark command: ``` python -m sglang.bench_latency --model-path /path_to_my_local_model/llama3_8B_chat_self_play_wenxin --batch 32 --input-len 256 --output-len...

In addition, `pip install -r requirements/requirements.txt` will download pytorch for cuda12, so I uninstalled it and installed pytorch for cuda11.8 after running this command

> Hi, What is your nvcc version? 我把镜像改了,问题解决了。但是解决以后遇到了新问题,,我放弃了