genrl issues

CUDA support Agents

9

sampreet-arthi

bug

good first issue

Priority:High

Algorithms

AttributeError: module 'tensorboard.lazy' has no attribute 'lazy_load'

1

I'm trying to execute this simple code ``` import gym from genrl.agents import QLearning from genrl.trainers import ClassicalTrainer env = gym.make("FrozenLake-v0") agent = QLearning(env) trainer = ClassicalTrainer(agent, env, mode="dyna", model="tabular",...

nbro

MCTS

2

Progress - - [X] Added modular structure for Tree search agents and tree search planners - [X] UCT Node - [ ] OPD - [ ] OLOP - [ ]...

hades-rp2010

RPC Communication in Distributed RL Training

2

There's three ways that I can think of having distributed training: 1. Use of Pytorch's Distributed Training infrastructure. Would require establishing communication protocols specific to the case of Deep RL....

Sharad24

enhancement

Core

no-issue-activity

c++

Logger Formatting

12

The current logger might go on to the next line if there are a lot of key, value pairs. There could be three solutions to this: 1. Put a limit...

Sharad24

enhancement

no-issue-activity

Distributional Agents

1

Agents should be structured in a way that they can be extended to distributional or distributed agents (and both as well, case in point: D4PG and lots of others :))....

Sharad24

no-issue-activity

Usage explanatory docs

44

Go to the `docs/source/usage/tutorials` and add separate `.md` files to explain the following: - [x] Using A2C (@Darshan-ko ) - [ ] Using PPO1 - [x] Using VPG (@Devanshu24 )...

sampreet-arthi

documentation

good first issue

Examples

no-issue-activity

save_to_gif argument in Trainer

3

Save a GIF file based on this argument in trainer. To-do: 1. Check tensorboard saving in video

Sharad24

good first issue

no-issue-activity

Custom Loss Functions for RL

3

We should think about common loss functions that are used a lot in RL that can be packaged. As of now, we're constructing everything from scratch so we're going towards...

Sharad24

help wanted

Priority:Low

Core

no-issue-activity

Mujoco, PyBullet support

12

We should develop an environment module with wrappers. For a starter, I find [TF Agents env module](https://github.com/tensorflow/agents/tree/master/tf_agents/environments) pretty good.

Sharad24

enhancement

help wanted

v0.1

no-issue-activity

genrl
genrl copied to clipboard

Metadata

CUDA support Agents

AttributeError: module 'tensorboard.lazy' has no attribute 'lazy_load'

MCTS

RPC Communication in Distributed RL Training

Logger Formatting

Distributional Agents

Usage explanatory docs

save_to_gif argument in Trainer

Custom Loss Functions for RL

Mujoco, PyBullet support

← Metadata

Owner

Metadata

genrl genrl copied to clipboard

Metadata

← Metadata

Owner

Metadata

genrl
genrl copied to clipboard