Reinforcement-Learning-Maze
Reinforcement-Learning-Maze copied to clipboard
How is the reward value in the maze environment set?
Hello, how do you set the reward value for each action in the environment, and what is the basis for judging?Thank you for your answer.