DeepSpeedExamples
DeepSpeedExamples copied to clipboard
单机多卡进行RLHF在第三步中使用Qwen模型作Actor Model报错
Hi, I'm also using deepspeedchat for RLHF training qwen, did you solve this problem?
^