Junyang Lin comments

Results 173 comments of


                                            Junyang Lin

Create a Share-OpenDevin for people to share their interaction trajectories

This is really a terrific idea! I think we should collect more data for fine-tuning to make things work better.

使用本地部署的qwen1.5_1.8b，不能调用agent函数

看着没什么问题，1.8b的超参数稍有不同，check generation_config。我测了q4_k_m的gguf，也是正常输出agent的内容。 ``` FROM TEMPLATE """{{ if .System }}system {{ .System }} {{ end }}{{ if .Prompt }}user {{ .Prompt }} {{ end }}assistant """ SYSTEM """You are a helpful...

Quantization support

Will GPTQ be supported?

[BUG]Qwen模型加载后NTK未生效

I should say it might be quite difficult to incorporate our NTK method with continuous batching I guess. Sorry for the inconvenience...

unlimited output，vllm 0.4.0 post ，0.5b-chat

```python python -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --port 8000 --model Qwen1.5-0.5B-Chat --dtype=half ``` using this command can replicate this issue right?

使用Qwen1.5-14b-chat或者Qwen1.5-32b-chat-awq，用英文提问时均会有概率出现中英混杂的情况

this is a known issue of the models. no good way to solve this problem temporarily until we update the checkpoints.

使用Qwen1.5-14b-chat或者Qwen1.5-32b-chat-awq，用英文提问时均会有概率出现中英混杂的情况

no eta. still working on it

Qwen1.5-110B-Chat的中文能力是否评测过，如cmmlu，ceval等

Yeah it even outcomptes 72B a lot. but as it is sometimes meaningless for these benchmarks to really reflect the quality and as we have confidence in Chinese, we did...

Qwen1.5-110B-Chat-GPTQ-Int4模型在system提示词中加入引号或者换行会导致模型不输出

we'll take a look but some strange things may happen at quantized models time to time. not sure if we can solve this, but just advise you to use a...

HOW TO CUSTOM embedding to finetune QWEN

what do custom embedding data refer to? Please illustrate your problem with more details