genrl issues

HER Wrappers

10

Wrt #171 Have added a `HERTrainer`, `HERGoalEnvWrapper`, and a `HERWrapper` for the replay buffer. Some changes in the locations of the tests might be needed.. Wasnt too sure of where...

hades-rp2010

Blocked

Stuff implemented: - Added BCQ under genrl/agents/offline - BCQ inherits from `OffPolicyAgentAC`. Architecture was very similar to TD3. Major differences were that the actor took in both state and action...

sampreet-arthi

A2C and VPG

1

Wrt #375 Mades some changes in discount.py (Really silly mistakes) Now the 2 agents are training

hades-rp2010

More comprehensive unit testing

There seem to be some vulnerabilities in our code that might fail easily. I suggest adding more unit tests for the following: - Custom agents (there's only VPG and PPO...

sampreet-arthi

enhancement

good first issue

CI/CD

PPO1, A2C, VPG, DQN not training for Atari envs

5

DQN is also not training but that'll be addressed after DQN is restructured.

sampreet-arthi

Categorical DQN not training

7

There might be some shape related errors or we're missing something. Either that or hyperparameters need to be tuned.

sampreet-arthi

bug

Algorithms

Updating docstrings

6

We're moving from the current docstring style to the Google docstring style. Please refer to [DQN](https://github.com/SforAiDl/genrl/blob/master/genrl/deep/agents/dqn/base.py) and [this](https://sphinxcontrib-napoleon.readthedocs.io/en/latest/example_google.html) for an idea. This is a pretty long issue and pretty important...

sampreet-arthi

documentation

good first issue

Code Climate Issues

1

Right now, we have about 50+ code smells on Code Climate. A lot of these are pretty hard to avoid. We should keep this as a long term issue. If...

sampreet-arthi

Priority:Low

Prioritized Replay Buffer Support for Off Policy Agents

2

From #169

Sharad24

enhancement

Extensibility of Agents

Documentation on How can we extend agents? @sampreet-arthi added a great tutorial which can be referred #259

Sharad24

documentation

good first issue

genrl
genrl copied to clipboard

Metadata

HER Wrappers

[WIP] Added BCQ

A2C and VPG

More comprehensive unit testing

PPO1, A2C, VPG, DQN not training for Atari envs

Categorical DQN not training

Updating docstrings

Code Climate Issues

Prioritized Replay Buffer Support for Off Policy Agents

Extensibility of Agents

← Metadata

Owner

Metadata

genrl genrl copied to clipboard

Metadata

← Metadata

Owner

Metadata

genrl
genrl copied to clipboard