Junyang Lin comments

Results 173 comments of


                                            Junyang Lin

Any plan for Qwen2 14B and Qwen2-32B?

这里做下解释。在Qwen2本次设计中，MoE是medium size的模型主力，仅仅激活14B，但是达到30B模型的效果。但当前生态对MoE的支持还不完善，57B的模型大小对于显存要求较高，我们正在计划补上14和32这两个size的模型，但这两个模型毕竟是比较大的模型，还需要一些时间。 We previously hope that the MoE model can be your choice for a medium-size model. It actually only activates only 14B params in each forward pass but it...

Junyang Lin

Any plan for Qwen2 14B and Qwen2-32B?

Build on top of MetaGPT

Add attach file button to chat input

Create Self-Discover Prompting Agent

Continue pretrainig support

Qwen1.5 72B Chat AWQ 需要多少显存可以运行

Swe-bench evaluation not working

ExceptionPxssh: Could not establish connection to host, docker login

用vllm部署，用openai api访问，合理的温度范围是多少啊

官方代码部署两个人调用会报错RuntimeError: probability tensor contains either inf, nan or element < 0 #102