Yue Cao
Yue Cao
Hello, thanks for your work. But I would like to ask if I want the convnext model to use local files instead of downloading them every time. How should this...
Hello, your job is so great. But I would like to ask, is it convenient to disclose the training parameters of your single branch convnext encoder? I am not very...
Hello, Thank you for your excellent work. Could you provide the loss curve of sft? Because when I tried it myself, I found that the loss curve would fluctuate greatly,...
Thank you for your excellent work. I have some questions about data processing and look forward to your response. 1. Do you use the same data processing scheme as SMDM,...
Thank you for your outstanding work. Following your recommendation, I attempted to implement LLaDA training from the SMDM code repository, but I am unsure how to set these parameters for...