rl-baselines3-zoo
rl-baselines3-zoo copied to clipboard
panda-gym v2
Description
Update benchmark for latest version of panda-gym.
Motivation and Context
- [ ] I have raised an issue to propose this change (required for new features and bug fixes)
Types of changes
- [ ] Bug fix (non-breaking change which fixes an issue)
- [ ] New feature (non-breaking change which adds functionality)
- [x] Breaking change (fix or feature that would cause existing functionality to change)
- [ ] Documentation (update in the documentation)
Checklist:
- [ ] I've read the CONTRIBUTION guide (required)
- [x] I have updated the changelog accordingly (required).
- [x] My change requires a change to the documentation.
- [ ] I have updated the tests accordingly (required for a bug fix or a new feature).
- [x] I have updated the documentation accordingly.
- [x] I have reformatted the code using
make format
(required) - [x] I have checked the codestyle using
make check-codestyle
andmake lint
(required) - [x] I have ensured
make pytest
andmake type
both pass. (required)
Note: we are using a maximum length of 127 characters per line
Hey,
thanks for the PR =) I might wait until Huggingface PR is merged https://github.com/DLR-RM/rl-baselines3-zoo/pull/198 I will help @simoninithomas in the coming days.
Quickly looking at it, no additional tuning was in fact required?
no additional tuning was in fact required?
No tuning has been done. It might be possible to find better hyperparameters. I leave this for a future PR. One exception: I had to decrease the learning for PickAndPlace with SAC. Otherwise, the loss was diverging.
Basic version is ready to review ;) https://github.com/DLR-RM/rl-baselines3-zoo/pull/250
An example: https://huggingface.co/sb3/ppo-MountainCarContinuous-v0
btw, this requires gym > 0.21, right?
btw, this requires gym > 0.21, right?
Yes, so it can't be merged before #256 I think
What's the status with this PR?
panda-gym v2 requires gym==0.24. But, for the moment, SB3 requires gym==0.21. So the status is "waiting for SB3 to be compatible with gym>=0.24"