rl-baselines3-zoo
rl-baselines3-zoo copied to clipboard
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
### ❓ Question Why is the number of parallel environments (n_envs) set to one when using the optimize-hyperparameters? https://github.com/DLR-RM/rl-baselines3-zoo/blob/e06914e9835b8f3233b18d59943b1464b89ddb90/rl_zoo3/exp_manager.py#L198 Thanks! ### Checklist - [X] I have checked that there is...
## Description improve doc adding "--gym-packages" argument source: https://github.com/DLR-RM/rl-baselines3-zoo/blob/e06914e9835b8f3233b18d59943b1464b89ddb90/rl_zoo3/train.py#L107-L112 ## Motivation and Context - [ ] I have raised an issue to propose this change ([required](https://github.com/DLR-RM/stable-baselines3/blob/master/CONTRIBUTING.md) for new features and...
### 🐛 Bug The [help](https://github.com/DLR-RM/rl-baselines3-zoo/blob/e06914e9835b8f3233b18d59943b1464b89ddb90/rl_zoo3/train.py#L44) for `--optimization-log-path` claims: _"Path to save the **evaluation log** and **optimal policy** for each hyperparameter tried during optimization. "_ However, when i run optimization with...
### 🐛 Bug I am encountering an issue when trying to train my donkeycar simulator agent using the train.py script from rl-baselines3-zoo. While I can successfully import and call the...
### ❓ Question Hi, I noticed that the algorithm discount factor and reward discount factor are set to be the same in lines 365-367 in rl_zoo3/exp_manager.py `# Use the same...
### ❓ Question I trained my custom env with tqc algorithm. However, the trained agent keeps reset every 1000 episodes even though I did not set number of episode for...
## Description Added hyperparameters for SB3 CrossQ. ## Motivation and Context - [ ] I have raised an issue to propose this change ([required](https://github.com/DLR-RM/stable-baselines3/blob/master/CONTRIBUTING.md) for new features and bug fixes)...
# Remove deprecated use of 'Repository' when pushing to huggingface hub ## Description As mentioned above, use of the `Repository` class is deprecated: . ## Motivation and Context - [x]...
### 🚀 Feature I got various errors uploading my models to huggingface_hub (didn't include this as a bug report as even the proposed commit that should have fixed my issue...
### ❓ Question Hello, I first optimize A2C on 1mln steps using RL Baselines3 Zoo: Firstly i have changed `a2c.yml` in RL Baselines3 Zoo to work with RAM version of...