genrl icon indicating copy to clipboard operation
genrl copied to clipboard

A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL

Results 63 genrl issues
Sort by recently updated
recently updated
newest added

Add documentation on the types of available env wrappers

documentation
good first issue
Customizability - Environments

@DarylRodrigo pointed out that the current explanations of Policy, Value functions are not good. One of our aims is to tackle Accessibility. Quoting him: "the intro into policy and value...

Accessibility - Tutorials

More explanations of bellman equations and their relation with MDPs and then Q-Learning, etc. At the moment, we have almost nothing wrt bellman equations! Thanks @DarylRodrigo

Accessibility - Tutorials

Current training is not reproducible. (Observation based on `deep.py` in `examples`). Something is messing up, not sure why seeding is not doing the job here. Could be that we're missing...

Priority:High
Core

We currently have a lot of issues, a lot of which are either incomplete, not relevant for the time being, need to be done at a later period or we...

I have been playing around with the DCBTrainer and found some potential inconsistencies. 1) **StatlogData** example found [here](https://genrl.readthedocs.io/en/latest/usage/tutorials/bandit/contextual_overview.html) ``` from genrl.utils import StatlogDataBandit bandit = StatlogDataBandit(download=True) context = bandit.reset() from...

documentation
good first issue

Should we shift to using [DM Env](https://github.com/deepmind/dm_env)? Should probably evaluate this as a potential option.

Customizability - Environments