Reinforcement-learning-with-tensorflow icon indicating copy to clipboard operation
Reinforcement-learning-with-tensorflow copied to clipboard

PPO中如何处理不同长度的episode?

Open YingxiaoKong opened this issue 4 years ago • 0 comments

莫凡你好, 我想问一下,如果我的每个episode的长度不一样该怎么办呢? 比如,我的每个episode都是以agent到达目的地结束,但是到达目的地的步数是不一样的。这种情况下该如何把他们变成一样的呢?

YingxiaoKong avatar Mar 19 '20 02:03 YingxiaoKong