Antonin RAFFIN comments

Results 880 comments of


                                            Antonin RAFFIN

[Question] Plotting Continued Models on the Same Line

> Will these methods work with all_plots.py as well? I just wanted to make sure for plotting training and evaluation. For evaluation, you will need to merge the `evaluations.npz` files....

[Question] Plotting Continued Models on the Same Line

> made a Google Colab notebook to perform this task. could you share the link as it might be useful for others? > it appears evaluations.npz is a dictionary where...

[Question] Plotting Continued Models on the Same Line

> For evaluations.npz, is there a way to get all the keys? `.keys()`?

[Question] Plotting Continued Models on the Same Line

> What are the differences between these model files? one is a checkpoint, this other is saved at the end of training, the last one is the best model according...

[Question] The performance for Hopper-v3 doesn't get converged for PPO

> Sorry, what more information is needed? The hyperparameters used and your system/lib information (os, gym version, mujoco version, sb3 version, ...)

Adding Bayesian optimization with BOTorch instead of TPE with Optuna

Hello, TPE is already kind of doing bayesian optimization, no? (predict the outcome for a given set of parameters and provide uncertainy). GP is already available here: https://github.com/DLR-RM/rl-baselines3-zoo/blob/master/rl_zoo3/exp_manager.py#L685-L691 > In...

Antonin RAFFIN

[Question] Plotting Continued Models on the Same Line

[Question] Plotting Continued Models on the Same Line

[Question] Plotting Continued Models on the Same Line

[Question] Plotting Continued Models on the Same Line

[Question] The performance for Hopper-v3 doesn't get converged for PPO

Adding Bayesian optimization with BOTorch instead of TPE with Optuna

Adding Bayesian optimization with BOTorch instead of TPE with Optuna

[Question] Plotting Smoothed Evaluation Curve

[Feature Request] Support Stochastic Weight Averaging (SWA) for improved stability

[Question] Inconsistent training of Panda manipulation tasks