Modas Li

Results 29 comments of Modas Li

> 1. vLLM + OpenAI API:如果是单独vLLM,是没有完整支持Qwen对话模式的,建议用FastChat+vLLM。 > 2. chat调用参数格式不正确,请参考OpenAI API说明修正messages的格式。 > 如果是单独的vLLM,还需要传入正确的stop_token_ids ([151643, 151644, 151645]) 谢谢解答,请问stop_token_ids 如何传入,在哪里传入呢?谢谢~

qwen-7b after sft fine-tuning 发自我的iPhone ------------------ Original ------------------ From: Ren Xuancheng ***@***.***> Date: Tue,Dec 12,2023 5:52 PM To: QwenLM/Qwen ***@***.***> Cc: Modas Li ***@***.***>, Author ***@***.***> Subject: Re: [QwenLM/Qwen] AssertionError:...

没办法进行多卡分配模型

same warning: The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input's attention_mask to obtain reliable results....

Solution: update model files to the latest and alter config(sft.json, ds_config.json) fp16 to bf16,make sure your GPUs are more than A100(40g)*4.

设置地址后,浏览器助手变空白 ![image](https://github.com/QwenLM/Qwen-Agent/assets/40042370/bb91807d-6e6f-4bde-9510-741a1b79cc11)

端口需要怎么调整呢?哪些是linux机器必须开通的呢

生成的图片如何修改IP地址呢? ![image](https://github.com/QwenLM/Qwen-Agent/assets/40042370/4fc9b8d5-5e13-4604-aaf0-86d7afc8bb3f)