TensorFlow_RLRE get_action and deicde

get_action and deicde_action

Open jiadd opened this issue 7 years ago • 4 comments

您好，请教一下rlmodel.py中的 get_action 和 deicde_action 有什么区别与联系呢，用这两个函数的作用分别是什么呢？谢谢！

Nov 05 '18 08:11 jiadd

get_action是根据概率采样。deicde_action是贪心的选择概率较大的。

Nov 06 '18 03:11 xuyanfu

明白了非常感谢！还有一个问题想请教，您的代码rlmodel.py中entity_embedding = tf.get_variable(name = 'entity_embedding',initializer=entity_ebd,trainable=False)，如果我想让entity_embedding参与训练即trainable=True，该怎么修改代码呢？现在计算梯度时报如下错误： gradBuffer[index] += grad（rlmodel.py 282行） ValueError: could not broadcast input array from shape (2,50) into shape (2) 谢谢！

Nov 13 '18 06:11 jiadd

不好意思，最近已经很久没有看过这个项目的代码了，抱歉帮不上什么忙。

Nov 13 '18 12:11 xuyanfu

您好，请问下您为什么要有两种action呢，麻烦解答了~

Oct 02 '19 05:10 zxs1995

TensorFlow_RLRE TensorFlow_RLRE copied to clipboard

get_action and deicde_action

TensorFlow_RLRE
TensorFlow_RLRE copied to clipboard