Pengyu Wang
Results
2
issues of
Pengyu Wang
Hi, I tried to use CuAssembler on the sample CuTest code. It works well on the build-in cubin files. However, when I recompiled it using NVCC11.6 and the same arguments,...
Greate work. But When I run examples/quick_start/srt_example_complete.py with RuntimeEndpoint("http://localhost:30000") with Server and V100 32GB ``` python -m sglang.launch_server --model-path ~/model/Llama-2-7b-chat-hf/ --port 30000 ``` Got result: ``` system : You are...
bug