GIS-PuppetMaster comments

Results 9 comments of


                                            GIS-PuppetMaster

Bug of PPO

it should be like this: self.cliped_ratio = tf.clip_by_value(self.ratio, 1. - METHOD['epsilon'], 1. + METHOD['epsilon']) self.min_temp = tf.minimum(self.ratio, self.cliped_ratio) self.aloss = -tf.reduce_mean(self.min_temp * self.tfadv)

Bug of PPO

> Why the negative value causes failure in actor loss? > You can also refer to OpenAI baselines [here](https://github.com/openai/baselines/tree/master/baselines/ppo1), which has similar process as our repo. I drawed the loss...

没看懂为什么push里面要把ba的类型转成np.int64

buffer_r.append((r+8.1)/8.1) # normalize 为什么r需要标准化？ 8.1是怎么来的？

Does RLzoo support Dict gym env state?

> Hi, > It supports dict state, but you need a wrapper for your env. > Please take a look at the FlattenDictWrapper (./common/env_wrappers.py) for robotics env. Thanks, I think...

严重卡顿

最大化卡顿，窗口化好了很多，好像越小越流畅

add PPO-HER

Is it possible? I think PPO v2 is not strictly an 'off-policy' algorithm.

Show more custom information of tensor and gradient

This is only a demo version, it seems Graphviz always displays the attribute with center alignment since this tool uses caption to contain attributes. Better to display them with left...

JPEG XL HDR images on a non-HDR-system appears dark and dull

same problem here, AVIF photo that exported by LightRoom with HDR information looks gray

Please add mode to service road intersections

The two roads are considered to be the same direction and their lights turn green at the same time