rl-baselines3-zoo issues

Misc/veridream

## Description ## Motivation and Context - [ ] I have raised an issue to propose this change ([required](https://github.com/DLR-RM/stable-baselines3/blob/master/CONTRIBUTING.md) for new features and bug fixes) ## Types of changes -...

araffin

RL Zoo CLI

3

It would be nice to have a cli that is available everywhere (with completion too), something like: ```bash sb3 train --algo ppo --env CartPole-v1 sb3 train --algo ppo --env CartPole-v1...

araffin

enhancement

Validation Environment Configuration When Tuning Hyperparameters

8

### Question Is it possible to pass parameters to a validation environment? ### Additional context I'm using rl-zoo to tune a DDPG agent on a custom environment. From the command...

gradnerj

question

[Enhancement] Support copying optuna params dict for all hyperparameters

4

Right now, only hyperparmeters that are searched by default can have their params dict be copied and reused due to naming issues. This should be extended to hyperparameters that are...

jkterry1

enhancement

Bug in make type script?

2

Running the make type script (for PR #140) results in an error I am not able to decipher that does not seem likely to be my fault: ``` justinterry@preacherMan2 rl-baselines3-zoo...

jkterry1

question

[Feature] Upstream hyperparameter eval script

4

Building off the jsons of the best hyperparameters saved in https://github.com/DLR-RM/rl-baselines3-zoo/pull/140, I have a script that takes each and runs trains it for 10 times (by default) to determine which...

jkterry1

enhancement

[Question] Every few days all rl-zoo jobs on a node stop without any errors and I can't figure out why

3

In my trials tuning both pettingzoo gym environments, all optuna jobs on one node have just simply stopped twice now. There have been no system errors, nothing printed to stdout,...

jkterry1

question

[Feature request] Hyperparameter optimization from pretrained agent

2

Enabling the possibility to run --optimize with the --trained-agent flag would be great ! In my case, I pre-trained an agent on a simplified task and want to continue training...

Jonathan2021

enhancement

[Other] Optimized Hyperparameters for LunarLanderContinuous_v2 with PPO

3

I needed to do a run with PPO on a Gym environment on my cluster to make sure everything is working right before moving onto tuning PettingZoo environments, so I...

jkterry1

Enqueue default trial for hyperparam optimization

3

See https://optuna.readthedocs.io/en/latest/reference/generated/optuna.study.Study.html#optuna.study.Study

araffin

enhancement

rl-baselines3-zoo
rl-baselines3-zoo copied to clipboard

Metadata

Misc/veridream

RL Zoo CLI

Validation Environment Configuration When Tuning Hyperparameters

[Enhancement] Support copying optuna params dict for all hyperparameters

Bug in make type script?

[Feature] Upstream hyperparameter eval script

[Question] Every few days all rl-zoo jobs on a node stop without any errors and I can't figure out why

[Feature request] Hyperparameter optimization from pretrained agent

[Other] Optimized Hyperparameters for LunarLanderContinuous_v2 with PPO

Enqueue default trial for hyperparam optimization

← Metadata

Owner

Metadata

rl-baselines3-zoo rl-baselines3-zoo copied to clipboard

Metadata

← Metadata

Owner

Metadata

rl-baselines3-zoo
rl-baselines3-zoo copied to clipboard