Tom Dörr comments

Results 149 comments of


                                            Tom Dörr

Logprobs are almost the same for all choices

Hi, thank you for your reply. I tried this but still have the same issue. I installed the newest sglang version directly from the repository ``` -e git+https://github.com/sgl-project/sglang.git@95c4e0dfac5a5f4a2f7f9292402fec26d0838f31#egg=sglang&subdirectory=python ``` prefill_token_logprobs:...

Logprobs are almost the same for all choices

@m0g1cian Yes, I did that. The model generates the boolean value even if not constraint

Logprobs are almost the same for all choices

I tried to extract the right logprob values from the `prefill_token_logprobs`. It seems to me like those are the logprobs of the generated tokens since the number of available token...

Logprobs are almost the same for all choices

The issue seems to be that when I add an image to the prompt there are to many logprobs in prompt_logprobs. Since only the last logprob differs the resulting normalized...

Logprobs are almost the same for all choices

Interesting. By the way the current version in the repo (main) has some changes to what logprob information you get. It also fails in the same way for me but...

Don't get API response when sending images

Solved it by setting `mem-fraction-static` to `0.9`. Full command that works for me: ``` python3 -m sglang.launch_server --model-path liuhaotian/llava-v1.6-34b --tokenizer-path liuhaotian/llava-v1.6-34b-tokenizer --port 30813 --tp 2 --mem-fraction-static '0.9' ``` Maybe a...

Tom Dörr

Logprobs are almost the same for all choices

Logprobs are almost the same for all choices

Logprobs are almost the same for all choices

Logprobs are almost the same for all choices

Logprobs are almost the same for all choices

Don't get API response when sending images

[Bug] llava-v1.6-34b can not enable Tensor Parallelism, server can not start

Does not like not setting proxy.

Add logprobs hfclientvllm

Failed to parse JSON response: {"detail":"Not Found"}