M. Ernestus

Results 35 issues of M. Ernestus

The [Apport](https://wiki.ubuntu.com/Apport) error reporting mechanism of Ubuntu as well as PyCharm register custom `sys.excepthook`s. This causes the warning: ``` trio/_core/_multierror.py:472: RuntimeWarning: You seem to already have a custom sys.excepthook handler...

exception handling

### 🐛 Bug When cloudpickle fails to deserialize an object, [`json_to_data`](https://github.com/DLR-RM/stable-baselines3/blob/54bcfa4544315fc920be0944fc380fd75e2f7c4a/stable_baselines3/common/save_util.py#L130) prints a warning (fine) but then replaces that object with any other object that just has been parsed before....

bug

I just got my Watchy and right after flashing the first example, the buttons become unresponsive (I can't enter the menu) and the screen seems to update every second instead...

### What is an issue? The documented go-to solution to do distributed optimization for `n` trials independent of the number of workers is to use the [MaxTrialsCallback](https://optuna.readthedocs.io/en/stable/reference/generated/optuna.study.MaxTrialsCallback.html?highlight=MaxTrialsCallback). There are a...

document
contribution-welcome

This is a draft until https://github.com/DLR-RM/rl-baselines3-zoo/pull/257 is resolved.

The expert demonstrations are referred in the code base as: - episodes - trajectories - lists of transitions - rollouts - demonstrations I think we should clarify in the documentation...

I still have a mild concern about the increase in test suite runtime, but this is something we can address in another PR if it proves sufficiently annoying. _Originally posted...

## Description This PR introduces hypothesis tests for BC and does some general cleanup such as introducing a GIVEN, WHEN, THEN structure where appropriate and renaming tests to make it...

When I run the pycharm code inspections, I get 8 errors, 177 warnings, 412 weak warnings and 3325 typos. This includes 77 duplicated code fragments and the following python specific...

enhancement

See the `feature/fix_lstm` branch for [a test](https://github.com/hill-a/stable-baselines/blob/feature/fix_lstm/tests/test_lstm_policy.py) which [fails](https://travis-ci.com/hill-a/stable-baselines/builds/95414555) for the above mentioned algorithms. For PPO1 and TRPO the cause seems to be that the batch size is not provided...

bug
help wanted