job

Results 4 issues of job

### Is there an existing issue for this? - [X] I have searched the existing issues ### Environment ```markdown - Milvus version: - Deployment mode(standalone or cluster): - MQ type(rocksmq,...

kind/bug
triage/needs-information

我用的是Qwen2-7B 不是Qwen2-7B-Instruct 因为Qwen2-7B 是128k ,Qwen2-7B-Instruct 32k 用vllm推理Qwen2-7B默认的参数会一直重复直到max_new_tokens的长度,我修改tokenizer_config.json中eos_token为后有时候还是会出现一直重复直到max_new_tokens的长度。 {'messages': [{'role': 'user', 'content': '1+1'}], 'model': 'Qwen2-7B', "stream":true,} ![Uploading 捕获2.JPG…]()