Sampreet
Sampreet
Go to the `docs/source/usage/tutorials` and add separate `.md` files to explain the following: - [x] Using A2C (@Darshan-ko ) - [ ] Using PPO1 - [x] Using VPG (@Devanshu24 )...
Stuff implemented: - Added BCQ under genrl/agents/offline - BCQ inherits from `OffPolicyAgentAC`. Architecture was very similar to TD3. Major differences were that the actor took in both state and action...
There seem to be some vulnerabilities in our code that might fail easily. I suggest adding more unit tests for the following: - Custom agents (there's only VPG and PPO...
DQN is also not training but that'll be addressed after DQN is restructured.
There might be some shape related errors or we're missing something. Either that or hyperparameters need to be tuned.
We're moving from the current docstring style to the Google docstring style. Please refer to [DQN](https://github.com/SforAiDl/genrl/blob/master/genrl/deep/agents/dqn/base.py) and [this](https://sphinxcontrib-napoleon.readthedocs.io/en/latest/example_google.html) for an idea. This is a pretty long issue and pretty important...
Right now, we have about 50+ code smells on Code Climate. A lot of these are pretty hard to avoid. We should keep this as a long term issue. If...
We currently have a lot of issues, a lot of which are either incomplete, not relevant for the time being, need to be done at a later period or we...