guest-oo
guest-oo
tensor_action = tensor_action.argmax(dim=1) IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 1)
\demo_A2C_PPO.py env_args = {'env_name': 'CartPole-v1', 'num_envs': 1, 'max_step': 500, 'state_dim': 4, 'action_dim': 2, 'if_discrete': True} | Arguments Remove cwd: ./CartPole-v1_DiscreteA2C_0 | Evaluator: | `step`: Number of samples, or total training...
class mazeenv(gym.Env): def __init__(self): self.action_space = spaces.Discrete(8) self.observation_space = spaces.Box(low=0, high=23, shape=(14,), dtype=np.float32) self.state = None self.seed() self.reset() def seed(self, seed=None): self.np_random, seed = seeding.np_random(seed) return [seed] def step(self, action):...
DQN
请问 我使用helloworld——DQN文件训练完成后 如何保存训练完成的神经网络参数以及在测试中对其进行使用。以及我应该如何设置自己想要的环境而不是gym现成的环境。
Author, I'm getting an error when I'm running train. usage: train [-h] [--no-wandb] trainer train: error: the following arguments are required: trainer
请问rrt算法中 当随机点与最近节点小于步长的时候,我看有的人写的时按照此时的方向继续向前扩展步长的长度,而您的代码是直接将随机点与最近节点相连,请问两者有区别吗?
May I ask the author that if I increase the speed of the moving obstacle, the collision between the robot and the obstacle will occur in the result of MPC-CBF?...
May I ask the author, where is the matrix used to calculate the terminal error, stage cost, and input cost? May I ask why mterm = self.model.aux['cost'] # Terminal cost...
File "C:\Users\user\Desktop\safe-explorer-master\safe-explorer-master\safe_explorer\multiprocessing.py", line 11, in def paramap(f, X,nprocs = multiprocessing.cpu_count() ): AttributeError: partially initialized module 'multiprocessing' has no attribute 'cpu_count' (most likely due to a circular import) and from multiprocessing...
n_states = 2 _x = model.set_variable(var_type='_x', var_name='x', shape=(n_states, 1)) mpc.bounds['lower', '_x', 'x'] = [2,2] mpc.bounds['upper', '_x', 'x'] = [22,22] Please ask the author that I have set the range of...