Wei-Lin Chiang comments

Results 111 comments of


                                            Wei-Lin Chiang

Occupied GPU memory keeps growing with more talks

this is expected behavior as @surak explained. we've been serving models for long time on [chat.lmsys.org](chat.lmsys.org) and did not find issue. however, if you find evidence for memory leak. let...

Connection errored out on a AWS Sagemaker notebook

@nd7141 @WGB0304 You may add `--share` to return a public url link, which may bypass this issue. Although the link only lives for 72hr.

Any idea on how to reproduce vicuna in Nvidia L4?

@Ejafa feel free to open if you have any other question.

Infinite loop or never returns for logistic regression in nearly degenerate case using scikit learn

Thanks for reporting this issue. we looked into it and found the issue is coming from the too small gradient norm in the beginning, which leads to a infinite loop...

LIBLINEAR library not found on windows!!!

could you run the following code in your python environments? ``` python import sys print(sys.platform) ``` I think that will identify the problem

Octave 4.0 and parallelised LibSVM don't work together

Hi, I have tested Octave 4.0.0 on Debian 9. It works well for 8-thread parallelism. Do you add the option "-lgomp" in make.m?

Octave 4.0 and parallelised LibSVM don't work together

You're welcome. Actually you can check the faq here, there is a step-by-step teaching for parallelizing libsvm using matlab/octave. http://www.csie.ntu.edu.tw/~cjlin/libsvm/faq.html#f8032

How to generate reference answers in MT-Bench?

Hi @huseyinatahaninan Sorry for the confusion. Let me try to clarify this. In our [paper](https://arxiv.org/abs/2306.05685) we study reference-based judge which the LLM judge generates a reference answer independently first and...

[Feature]: Support for Reka AI

Strong +1. Reka has been offering great language + vision models. Would love to see litellm supports it.

qwen-72b-chat runs in Fastchat vllm , the input has weird message that is not mine.

this is unexpected. can you try to add `--debug` ``` python3 -m fastchat.serve.cli --model-path qwen/qwen-72b-chat --debug ``` and check what exactly is the loaded chat template?