Jimmy_L comments

Repositories
Issues
Comments

Results 24 comments of


                                            Jimmy_L

多卡微调Qwen2.5-14B显存分配不均

> 用deepseek-110k做finetune的效果咋样我只做了finetune，没有强化学习。调完基本也能模仿think的方式进行思考吧，批量测试时候发现偶尔会遇到复读机状态。

LLM JSON Output Incorrectly Extracts Data from "<think>" Label

Same problem, I use 1.8.1 version now, some times reasoning model will repeat the json schema in thinking part then output the final json in answer, but dify extract the...

使用xinfernece启动DeepSeek-R1-Distill-Qwen-14B时候，通过程序调用接口，发现缺少<think>起始符号，有</think>

同样的问题，我直接用vllm0.7.2 ‘vllm sreve’ 跑deepseek-Qwen-Distill-32B无量化版是有完整标签的，但是用xinference里面用vllm0.7.2，就会缺标签。

Uploading pictures in the workflow seems to get an error

所以应该怎么解决？我用的是zhipuai。