Cheng Rui

[email protected]

Hi, I am a CV engineer, video framework development, AIGC，I hope my algorithm and engineering skills will continue to improve

Results 14 issues of


                                            Cheng Rui

heLlo ,非常棒的工作有些问题想请教下

5

comment

1.COT的强化是否指在通过在模型输入和输出格式在SFT中训练体现？ 2. 类似之前autoweb WEBL的强化学习后训练后续会有这块工作分享吗？ 3. 用PRM-PPO是否后续GUI AGENT或者VLM LLM的唯一途径？ 4. 会开源数据集和PT的训练细节吗？

Hello，想使用其他模型微调这种场景，方便提供一些数据集和参考例子吗？

hello,some questions about project

4

comment

感谢提供的项目idea 1.if only text input , which is equivalent to Mindsearch? 2.如果VLM的能力经过微调或者是更大的vlm是否可能替代掉ground dino? 有没有考虑提供分离大模型服务的后端API？ 3.搜索模型使用的是Internlm2原因是否只是因为这个模型经过相关数据训练，这个几个步骤有没有可能可以合并为一个VLM进行，目前因为模型能力受限。所以做的过渡组合？

Why is the TPS of eagle3-qwen in the sglang inference of single-card H20 not as high as that of the original QWEN3 when the decoding algorithm is added

1

comment

Hello, I'm testing the speed of 100 tokens on a single H20. The original qwen3 has 200TPS during sglang inference, while the draft model eagle3 only has 130TPS. What's the...

‹
1
2