examples RL Examples had bugs on current gym version

RL Examples had bugs on current gym version

Open sanggusti opened this issue 1 year ago • 0 comments

Your issue may already be reported! Please search on the issue tracker before creating one.

Context

Pytorch version:
Operating System and version: Ubuntu 20

Your Environment

Installed using source? [yes/no]:
Are you planning to deploy it using docker container? [yes/no]:
Is it a CPU or GPU environment?:
Which example are you using: reinforcement_learning
Link to code or data to repro [if any]:

Expected Behavior

This example script (reinforce.py and actor_critic.py) should be running well without encountering any bugs.

Current Behavior

When running the script (reinforce.py and actor_critic.py), there are error:

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
[<ipython-input-8-263240bbee7e>](https://localhost:8080/#) in <cell line: 1>()
----> 1 main()

[<ipython-input-4-6af08085b221>](https://localhost:8080/#) in main()
     87     running_reward = 10
     88     for i_episode in count(1):
---> 89         state, _ = env.reset()
     90         ep_reward = 0
     91         for t in range(1, 10000):  # Don't infinite loop while learning

ValueError: too many values to unpack (expected 2)

Possible Solution

Here I put my pull request that run on my system (gym version 0.25.2) https://github.com/pytorch/examples/pull/1212

Steps to Reproduce

Go to folder reinforcement_learning
run actor_critic.py or reinforce.py with gym version 0.25.2 ...

Failure Logs [if any]

### Tasks
- [ ] https://github.com/pytorch/examples/pull/1212

Jan 10 '24 13:01 sanggusti

examples examples copied to clipboard

RL Examples had bugs on current gym version

Context

Your Environment

Expected Behavior

Current Behavior

Possible Solution

Steps to Reproduce

Failure Logs [if any]

examples
examples copied to clipboard