Shuo Yin
Shuo Yin
我是小白,但是根据教程和文档,我认为本教程 -> 高级篇 -> 代理转发 一章中 ” 链式代理转发“ 的例子,数据包经过的节点顺序应该是:PC -> DOUS -> DOSG -> AliSG -> AliHK -> 目标网站,而不是 PC -> AliHK -> AliSG -> DOSG -> DOUS -> 目标网站。因为如果...
Hello, your work is very good and I appreciate it that I learned a lot from your paper and your code. But when I reproduce your experiment using `python main.py...
### Required prerequisites - [X] I have read the documentation . - [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/safe-rlhf/issues) and [Discussions](https://github.com/PKU-Alignment/safe-rlhf/discussions) that this hasn't already been reported. (+1 or comment...
Hi,感谢开源~ 有个问题求解~ 我在 4 机 8 卡 H20 开 zero-3 训 InternVl3.5-30B-A3B 的时候,一直 hang 住了,log 截图如下 GPU 一直是利用率 100%、显存 10G 的样子,显然是没开始训 同样环境,训 4B 的 dense 模型能正常。按照 dense 的训练经验,上述 log 打印之后应该是进入...
Thanks for your contribution! Very great work! When I used ppo ray (grpo), I found the RAM was like this during training: Observations: 1. At each saving point, the cache...