Pengyu Wang

Results 2 issues of Pengyu Wang

Hi, I tried to use CuAssembler on the sample CuTest code. It works well on the build-in cubin files. However, when I recompiled it using NVCC11.6 and the same arguments,...

Greate work. But When I run examples/quick_start/srt_example_complete.py with RuntimeEndpoint("http://localhost:30000") with Server and V100 32GB ``` python -m sglang.launch_server --model-path ~/model/Llama-2-7b-chat-hf/ --port 30000 ``` Got result: ``` system : You are...

bug