mushroom-rl issues

Results 15 mushroom-rl issues

Sort by recently updated

[Categorical DQN/Rainbow] Inconsistent behavior of Categorical DQN for an even number of atoms

For an even number of atoms, the calculation of ``self._a_values`` (see [here](https://github.com/MushroomRL/mushroom-rl/blob/649eefeefdcec49bdf4beacd3b06e85ebd8bce90/mushroom_rl/algorithms/value/dqn/categorical_dqn.py#L36)) does not seem to be 100% correct. This behavior is reproducible via ```python import torch v_min = -5...

Flo-Wo

bug

Suggestion: Add median to compute_metrics

Two common performance metrics used in the RL literature are mean return and median return because the median is less influenced by outliers. However, `compute_metrics()` does not compute the median....

RylanSchaeffer

enhancement

Suggestion: rename episodes_length to compute_episodes_length

If I can make a suggestion, perhaps you could rename the function `mushroom_rl.utils.dataset.episodes_length()` to `mushroom_rl.utils.dataset.compute_episodes_length()`. This would better match your function naming convention (e.g. `compute_J()`, `compute_metrics()`) and would also allow...

RylanSchaeffer

enhancement

Question: TorchApproximator.predict - Why no torch.no_grad() and why call forward directly?

**Is your feature request related to a problem? Please describe.** I was wondering why the [predict method in class TorchApproximator](https://github.com/MushroomRL/mushroom-rl/blob/af7dec46c28a83d133c2b985746f54c8b25aa95e/mushroom_rl/approximators/parametric/torch_approximator.py#L80) calculates the gradients and calls ``self.network.forward(*torch_args, **kwargs)``. **Describe the solution...

VanillaWhey

question

Can't install package

Can't install the package ```./python.exe -m pip install mushroom-rl``` **System information:** - OS: Windows 10 (21H2, 19044.2006) - Python version: 3.9 & 3.10 - Torch version: 1.12.1 - Mushroom version:...

Guiorgy

bug

Documentation update

Maksud20

SAC postload optimizer for alpha

I've found out in SAC the optimizer for alpha is not loaded properly. The alpha parameter is not continued training after load the agent for the second time

PuzeLiu

bug

compress frames

**Is your feature request related to a problem? Please describe.** I would like to reduce the memory taken by RL with atari, so I can run many experiments at the...

davidenitti

enhancement

dynaq agent

Dyna-Q is a conceptual algorithm that illustrates how real and simulated experience can be combined in building a policy. Planning in RL terminology refers to using simulated experience generated by...

sharabhshukla

enhancement

Patch 6

wrapper_args is a list of lists, not list of dicts

Katze2664

mushroom-rl
mushroom-rl copied to clipboard

Metadata

[Categorical DQN/Rainbow] Inconsistent behavior of Categorical DQN for an even number of atoms

Suggestion: Add median to compute_metrics

Suggestion: rename episodes_length to compute_episodes_length

Question: TorchApproximator.predict - Why no torch.no_grad() and why call forward directly?

Can't install package

Documentation update

SAC postload optimizer for alpha

compress frames

dynaq agent

Patch 6

← Metadata

Owner

Metadata

mushroom-rl mushroom-rl copied to clipboard

Metadata

← Metadata

Owner

Metadata

mushroom-rl
mushroom-rl copied to clipboard