Reinforcement-learning-with-tensorflow issues

关于10_A3C文件夹里面后三个代码文件出现如下问题：tuple indices must be integers or slices, not tuple的解决办法

原因：原本代码中有这样一行代码s = self.env.reset()，返回的s是个元组类型([ 表示状态的三个数 ], { }) 。而choose_action函数里面的s[np.newaxis, :]，这是对数组s的行增加一个维度操作。而按照原本的代码，传进来的是元组，故不可行。解决：把元组的第一个数组元素抽取出来就好：s = self.env.reset() 改为 s = self.env.reset()[0]。此外，self.env.step(a) 这里也会出错，因为它返回的有5个值，所以得用5个变量来接收，在info后随便加个变量名就好。（反正info和要加的变量名都没有用到，所以它俩的顺序不重要。）另外，如果启用self.env.render()，要在gym.make里加个渲染模式，这不是重点，重点是代码中有两处gym.make，都要加上，否则没用。注意：我在开启渲染后会卡住，无法显示渲染窗口。

Jing-Loog

INPUT and OUTPUT-solve classifier-question

I really want to know how to make the format of dataset.I have 30-demension variables as input and 0-1class as output .how can I put it into the SAC model?

luzi560

每次运行实例都会出现中断，产生keyerror：

1

错误如下 Traceback (most recent call last): File "C:\Program Files\Python39\lib\site-packages\pandas\core\indexes\base.py", line 3080, in get_loc return self._engine.get_loc(casted_key) File "pandas\_libs\index.pyx", line 70, in pandas._libs.index.IndexEngine.get_loc File "pandas\_libs\index.pyx", line 98, in pandas._libs.index.IndexEngine.get_loc File "pandas\_libs\index_class_helper.pxi", line...

ZQYyyo

关于DDPG算法

1

def choose_action(self, s): s = s[np.newaxis, :] # single state return self.sess.run(self.a, feed_dict={S: s})[0] # single action 你好，想问问这个函数返回值最后[0]是返回的什么东西呢

zhenbin-li

迷宫环境的疑问

请问作者在设计DQN的迷宫环境时，每次reset可以产生随机生成环境吗？智能体与障碍物发生碰撞的时候，可以选择让其保持在原有位置吗？请问可以将self.observation_space = spaces.Box(low=0, high=3, shape=(529,), dtype=np.int32) 迷宫环境转为一维的吗？例如这样self.observation_space = spaces.Box(low=0, high=3, shape=(529,), dtype=np.int32) 0为可移动 1为智能体位置 2 为终点 3 为障碍物

guest-oo

add conda env support

更新编程环境文件，可以通过conda根据新文件创建环境。减少了学习者配环境的压力

wsqstar

迷宫问题结果有随机性吗

想请问大家，有没有觉得迷宫的运行结果具有随机性，有时候能很快学到、有时候不能很快学到甚至在规定episode内学得很差，这是什么原因，应该怎么解决呢？

ccconquer

ImportError: cannot import name 'DeepQNetwork'

请问运行run_this.py的时候会报 --------------------------------------------------------------------------- ImportError Traceback (most recent call last) /data/jupyter_workspace/mofan/contents/5_Deep_Q_Network/run_this.py in 1 from maze_env import Maze ----> 2 from RL_brain import DeepQNetwork 3 4 5 def run_maze(): ImportError: cannot import name...

MikasaIL

打算做个视频讲GRPO吗？

莫凡打算做个视频讲GRPO吗？

shijiale0609

Reinforcement-learning-with-tensorflow
Reinforcement-learning-with-tensorflow copied to clipboard

Metadata

关于10_A3C文件夹里面后三个代码文件出现如下问题：tuple indices must be integers or slices, not tuple的解决办法

INPUT and OUTPUT-solve classifier-question

每次运行实例都会出现中断，产生keyerror：

关于DDPG算法

迷宫环境的疑问

add conda env support

迷宫问题结果有随机性吗

ImportError: cannot import name 'DeepQNetwork'

打算做个视频讲GRPO吗？

← Metadata

Owner

Metadata

Reinforcement-learning-with-tensorflow Reinforcement-learning-with-tensorflow copied to clipboard

Metadata

← Metadata

Owner

Metadata

Reinforcement-learning-with-tensorflow
Reinforcement-learning-with-tensorflow copied to clipboard