M. Ernestus issues

Results 35 issues of


                                            M. Ernestus

Trio and PyCharm fighting over sys.excepthook

The [Apport](https://wiki.ubuntu.com/Apport) error reporting mechanism of Ubuntu as well as PyCharm register custom `sys.excepthook`s. This causes the warning: ``` trio/_core/_multierror.py:472: RuntimeWarning: You seem to already have a custom sys.excepthook handler...

exception handling

[Bug] in json_to_data when cloudpickle fails

### 🐛 Bug When cloudpickle fails to deserialize an object, [`json_to_data`](https://github.com/DLR-RM/stable-baselines3/blob/54bcfa4544315fc920be0944fc380fd75e2f7c4a/stable_baselines3/common/save_util.py#L130) prints a warning (fine) but then replaces that object with any other object that just has been parsed before....

bug

Buttons stop working and screen flashes in 1s intervals after flashing examples

I just got my Watchy and right after flashing the first example, the buttons become unresponsive (I can't enter the menu) and the screen seems to update every second instead...

Gotchas with the MaxTrialsCallback

### What is an issue? The documented go-to solution to do distributed optimization for `n` trials independent of the number of workers is to use the [MaxTrialsCallback](https://optuna.readthedocs.io/en/stable/reference/generated/optuna.study.MaxTrialsCallback.html?highlight=MaxTrialsCallback). There are a...

document

contribution-welcome

Load expert models for testing from huggingface hub

This is a draft until https://github.com/DLR-RM/rl-baselines3-zoo/pull/257 is resolved.

Inconsistent naming of expert demonstrations

The expert demonstrations are referred in the code base as: - episodes - trajectories - lists of transitions - rollouts - demonstrations I think we should clarify in the documentation...

Test suite speed improvements

I still have a mild concern about the increase in test suite runtime, but this is something we can address in another PR if it proves sufficiently annoying. _Originally posted...

Introduce hypothesis testing to BC tests

## Description This PR introduces hypothesis tests for BC and does some general cleanup such as introducing a GIVEN, WHEN, THEN structure where appropriate and renaming tests to make it...

Code Cleanup

When I run the pycharm code inspections, I get 8 errors, 177 warnings, 412 weak warnings and 3325 typos. This includes 77 duplicated code fragments and the following python specific...

enhancement

LSTM policies are broken for PPO1 and TRPO

See the `feature/fix_lstm` branch for [a test](https://github.com/hill-a/stable-baselines/blob/feature/fix_lstm/tests/test_lstm_policy.py) which [fails](https://travis-ci.com/hill-a/stable-baselines/builds/95414555) for the above mentioned algorithms. For PPO1 and TRPO the cause seems to be that the batch size is not provided...

bug

help wanted