Wuhan Zhang comments

Results 10 comments of


                                            Wuhan Zhang

Qwen1.5-7B-base模型推理速度比Qwen1.5-7B-chat模型速度慢很多

我也是，我感觉不是7b-base慢了，Qwen1.5-7B-chat变快了

Qwen1.5-7B-base模型推理速度比Qwen1.5-7B-chat模型速度慢很多

我是用fastchat的

[CLI]: BrokenPipeError: [Errno 32] Broken pipe

I found that when I launch a Weights & Biases (wandb) service with simulated data alone, there are no issues with the service communication. However, when I simultaneously load a...

Cannot re-initialize CUDA in forked subprocess. To use CUDA with multiprocessing, you must use the 'spawn' start method

> i modified the sglang's code, and it worked for me add this in sglang>srt>server.py line 143 > > ```python > try: > mp.set_start_method('spawn', force=True) > print("spawned") > except RuntimeError:...

[BUG: Could not find consolidated.00.pth or consolidated.safetensors in Mistral model path but mistralai/Mistral-Large-Instruct-2407 surely not contains it

I have encountered the same problem.

[BUG: Could not find consolidated.00.pth or consolidated.safetensors in Mistral model path but mistralai/Mistral-Large-Instruct-2407 surely not contains it

> > I have encountered the same problem. > > You can directly using vllm for inference, I find it compatibale with Mistral-Large-2 use api？

[BUG: Could not find consolidated.00.pth or consolidated.safetensors in Mistral model path but mistralai/Mistral-Large-Instruct-2407 surely not contains it

> @liuanping @shangh1 @fuegoio all my package version are listed above, as for vllm, that is vllm==0.5.2 inference code is quite simple , I'm using 4*H100 for mistral-large-2 > >...

Wuhan Zhang

Qwen1.5-7B-base模型推理速度比Qwen1.5-7B-chat模型速度慢很多

Qwen1.5-7B-base模型推理速度比Qwen1.5-7B-chat模型速度慢很多

[CLI]: BrokenPipeError: [Errno 32] Broken pipe

Cannot re-initialize CUDA in forked subprocess. To use CUDA with multiprocessing, you must use the 'spawn' start method

[BUG: Could not find consolidated.00.pth or consolidated.safetensors in Mistral model path but mistralai/Mistral-Large-Instruct-2407 surely not contains it

[BUG: Could not find consolidated.00.pth or consolidated.safetensors in Mistral model path but mistralai/Mistral-Large-Instruct-2407 surely not contains it

[BUG: Could not find consolidated.00.pth or consolidated.safetensors in Mistral model path but mistralai/Mistral-Large-Instruct-2407 surely not contains it

Will this project currently support image and PDF URL parsing functionality?

[Bug]: vllm.engine.async_llm_engine.AsyncEngineDeadError: Background loop has errored already.

[Bug]: 使用vllm进行推理时，设置parallel_tool_calls似乎不生效。我想实现单工具调用，应该怎么设置呢