gym-cartpole-swingup
gym-cartpole-swingup copied to clipboard
chore(deps): bump gym from 0.18.3 to 0.26.2
Bumps gym from 0.18.3 to 0.26.2.
Release notes
Sourced from gym's releases.
0.26.2
Release notes
This is another very minor bug release.
Bugs Fixes
- As
reset
now returns(obs, info)
then in the vector environments, this caused the finalstep
's info to be overwritten. Now, the final observation and info are contained within the info as "final_observation" and "final_info"@pseudo-rnd-thoughts
- Adds warnings when trying to render without specifying the
render_mode
@younik
- Updates Atari Preprocessing such that the wrapper can be pickled
@vermouth1992
- Github CI was hardened to such that the CI just has read permissions
@sashashura
- Clarify and fix typo in
GraphInstance
@ekalosak
0.26.1
Release Notes
This is a very minor bug fix release for 0.26.0
Bug Fixes
- #3072 - Previously
mujoco
was a necessary module even if onlymujoco-py
was used. This has been fixed to allow onlymujoco-py
to be installed and used.@YouJiacheng
- #3076 -
PixelObservationWrapper
raises an exception if theenv.render_mode
is not specified.@vmoens
- #3080 - Fixed bug in
CarRacing
where the colour of the wheels were not correct@foxik
- #3083 - Fixed BipedalWalker where if the agent moved backwards then the rendered arrays would be a different size.
@younik
Spelling
- Fixed truncation typo in readme API example
@rdnfn
- Updated pendulum observation space from angle to theta to make more consistent
@ikamensh
0.26.0
Release notes for v0.26.0
This release is aimed to be the last of the major API changes to the core API. All of the previously "turned off" changes of the base API (step termination / truncation, reset info, no seed function, render mode determined by initialization) are now expected by default. We still plan to make breaking changes to Gym itself, but to things that are very easy to upgrade (environments and wrappers), and things that aren't super commonly used (the vector API). Once those aspects are stabilized, we'll do a proper 1.0 release and follow semantic versioning. Additionally, unless something goes terribly wrong with this release and we have to release a patched version, this will be the last release of Gym for a while.
If you've been waiting for a "stable" release of Gym to upgrade your project given all the changes that have been going on, this is the one.
We also just wanted to say that we tremendously appreciate the communities patience with us as we've gone on this journey taking over the maintenance of Gym and making all of these huge changes to the core API. We appreciate your patience and support, but hopefully, all the changes from here on out will be much more minor.
Breaking backward compatibility
These changes are true of all gym's internal wrappers and environments but for environments not updated, we provide the
EnvCompatibility
wrapper for users to convert old gym v21 / 22 environments to the new core API. This wrapper can be easily applied ingym.make
andgym.register
through theapply_api_compatibility
parameters.
Step
Termination / truncation - TheEnv.step
function returns 5 values instead of 4 previously (observations, reward, termination, truncation, info
). A blog with more details will be released soon to explain this decision.@arjun-kg
- Reset info - The
Env.reset
function returns two values (obs
andinfo
) with noreturn_info
parameter for gym wrappers and environments. This is important for some environments that provided action masking information for each actions which was not possible for resets.@balisujohn
- No
Seed
function - WhileEnv.seed
was a helpful function, this was almost solely used for the beginning of the episode and is added togym.reset(seed=...)
. In addition, for several environments like Atari that utilise external random number generators, it was not possible to set the seed at any time other thanreset
. Therefore,seed
is no longer expected to function within gym environments and is removed from all gym environments@balisujohn
- Rendering - It is normal to only use a single render mode and to help open and close the rendering window, we have changed
Env.render
to not take any arguments and so all render arguments can be part of the environment's constructor i.e.,gym.make("CartPole-v1", render_mode="human")
. For more detail on the new API, see blog post@younik
Major changes
- Render modes - In
v25
, there was a change in the meaning of render modes, i.e. "rgb_array" returned a list of rendered frames with "single_rgb_array" returned a single frame. This has been reverted in this release with "rgb_array" having the same meaning as previously to return a single frame with a new mode "rgb_array_list" returning a list of RGB arrays. The capability to return a list of rendering observations achieved through a wrapper applied duringgym.make
. #3040@pseudo-rnd-thoughts
@younik
- Added
save_video
that usesmoviepy
to render a list of RGB frames and updatedRecordVideo
to use this function. This removes support for recordingansi
outputs. #3016@younik
RandomNumberGenerator
functions:rand
,randn
,randint
,get_state
,set_state
,hash_seed
,create_seed
,_bigint_from_bytes
and_int_list_from_bigint
have been removed.@balisujohn
- Bump
ale-py
to0.8.0
which is compatibility with the new core API- Added
EnvAPICompatibility
wrapper@RedTachyon
... (truncated)
Commits
a368cfa
Bump to V26.2 (#3116)780e884
ENH: add render warn for None (#3112)1486d33
Fix reset info being lost in vector environments (#3111)21e6e27
Update atari_preprocessing.py (#3105)2ac7b1c
clarified and fixed typo in GraphInstance docstring (#3100)4544bb9
Update README.mde055ee7
GitHub Workflows security hardening (#3094)53d784e
Bump version to v26.1 (#3090)94aaff9
Update pendulum.py (#3086)af14749
Update the CarRacing main method to the new rendering API. (#3085)- Additional commits viewable in compare view
Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase
.
Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
-
@dependabot rebase
will rebase this PR -
@dependabot recreate
will recreate this PR, overwriting any edits that have been made to it -
@dependabot merge
will merge this PR after your CI passes on it -
@dependabot squash and merge
will squash and merge this PR after your CI passes on it -
@dependabot cancel merge
will cancel a previously requested merge and block automerging -
@dependabot reopen
will reopen this PR if it is closed -
@dependabot close
will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually -
@dependabot ignore this major version
will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) -
@dependabot ignore this minor version
will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) -
@dependabot ignore this dependency
will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)