mushroom-rl icon indicating copy to clipboard operation
mushroom-rl copied to clipboard

Python library for Reinforcement Learning.

Results 15 mushroom-rl issues
Sort by recently updated
recently updated
newest added

For an even number of atoms, the calculation of ``self._a_values`` (see [here](https://github.com/MushroomRL/mushroom-rl/blob/649eefeefdcec49bdf4beacd3b06e85ebd8bce90/mushroom_rl/algorithms/value/dqn/categorical_dqn.py#L36)) does not seem to be 100% correct. This behavior is reproducible via ```python import torch v_min = -5...

bug

Two common performance metrics used in the RL literature are mean return and median return because the median is less influenced by outliers. However, `compute_metrics()` does not compute the median....

enhancement

If I can make a suggestion, perhaps you could rename the function `mushroom_rl.utils.dataset.episodes_length()` to `mushroom_rl.utils.dataset.compute_episodes_length()`. This would better match your function naming convention (e.g. `compute_J()`, `compute_metrics()`) and would also allow...

enhancement

**Is your feature request related to a problem? Please describe.** I was wondering why the [predict method in class TorchApproximator](https://github.com/MushroomRL/mushroom-rl/blob/af7dec46c28a83d133c2b985746f54c8b25aa95e/mushroom_rl/approximators/parametric/torch_approximator.py#L80) calculates the gradients and calls ``self.network.forward(*torch_args, **kwargs)``. **Describe the solution...

question

Can't install the package ```./python.exe -m pip install mushroom-rl``` **System information:** - OS: Windows 10 (21H2, 19044.2006) - Python version: 3.9 & 3.10 - Torch version: 1.12.1 - Mushroom version:...

bug

I've found out in SAC the optimizer for alpha is not loaded properly. The alpha parameter is not continued training after load the agent for the second time

bug

**Is your feature request related to a problem? Please describe.** I would like to reduce the memory taken by RL with atari, so I can run many experiments at the...

enhancement

Dyna-Q is a conceptual algorithm that illustrates how real and simulated experience can be combined in building a policy. Planning in RL terminology refers to using simulated experience generated by...

enhancement

wrapper_args is a list of lists, not list of dicts