vahidqo

Results 7 issues of vahidqo

when the environment has a action space that each action has different size, like this: `self.action_space = MultiDiscrete([3,2])` and the action masker is like this for example: `a = [[True,...

documentation
help wanted
question

any English doc, please???? I really need that.

Dear all, could you please explain how to use Authorization? for example, how to set User Permissions and use it like this: `{permissions === 'admin' && }` bests,

Hi there I want to use fileinput but I get this error : `The submitted data was not a file. Check the encoding type on the form` when I use...

### 🐛 Bug During training, nan values are produced by the algorithm. These nan values are produced in the neural network. I found several ideas in issues that were proposed,...

bug
more information needed
custom gym env
No tech support

### ❓ Question Hi, I'm using MaskablePPO on a powerful computer but the speed of the training doesn't change compared to a normal computer. Is there any option or line...

question

### ❓ Question Hi, During training in a custom environment with MaskablePPO, the reward decreased and then converged. Is there any specific reason? It means the algorithm has found a...

question
more information needed
custom gym env