imitation issues

Add support for SB3 callbacks in adversarial training

1

## Description Support for stable_baselines3 style callbacks in adversarial training. This feature was partly addressed in #626, but the author seems to have lost interest there ## Testing Tests in...

smanolloff

Adversarial algorithm matching original paper's implementation

1

## Description This PR updates the adversarial algorithm by training the discriminator between collecting the rollouts of the generator and training the generator. This matches the reference implementation provided in...

taufeeque9

Support for Dictionary Observation Space

7

## Problem Robotic Env such as [SurRoL]( https://github.com/med-air/SurRoL) and [Fetch](https://robotics.farama.org/envs/fetch/) uses Dictionary Observation Space, with 1. observation 2. desired_goal 3. achieved_goal as keys. ## Query Is there a quick fix...

damaruga

enhancement

Use data acquired by users

10

I am testing 'imitation' to work with proprietary environments for the robot in our Mujoco-based lab. For testing I am generating Pygame environments based on Gym. I am creating user...

AdrianPrados

enhancement

Consider allowing integer reward in trajectories

## Problem Due to [this validation](https://github.com/HumanCompatibleAI/imitation/blob/5c85ebf02a591dad171946710d80617cfcca108e/src/imitation/data/types.py#L131) environments returning integer rewards will throw an exception, e.g. when I try to collect rollouts from an expert policy. This seems a bit overzealous....

PavelCz

enhancement

Support simple, synchronous, CLI or Jupyter human preference collection

## Problem Today only synthetic preferences are supported. It would be great to support real human preferences. ## Solution Requirements: - record videos of trajectories - ideally, extensible so we...

timbauman

enhancement

Add support for a simple preference UI

3

## Description See #711 ## Testing TODO: add notebook and experiment config that use this feature, and screenshots of behavior. (I've tested myself but not in a clean way.)

timbauman

Add scripts for type-checking rst doc files and Jupyter notebooks

1

## Description Here are two scripts I used for checking for type errors in documentation and notebooks. I don't know whether this is of use to anyone. So I figured...

jas-ho

Hydra exploration

2

Right now we use Sacred to run experiments/algorithms. This PR is about exploring whether Hydra would be a good option for running experiments and constructing/configuring the CLI interface of `imitation`....

ernestum

Discussing whether Trajectory types be used for other uses than the imitation learning

I wouldn't mind seeing something discussing whether these trajectory objects can only be used for imitation algorithms or can also be used for stable baselines3 or offline RL algorithms, and...

ernestum

imitation
imitation copied to clipboard

Metadata

Add support for SB3 callbacks in adversarial training

Adversarial algorithm matching original paper's implementation

Support for Dictionary Observation Space

Use data acquired by users

Consider allowing integer reward in trajectories

Support simple, synchronous, CLI or Jupyter human preference collection

Add support for a simple preference UI

Add scripts for type-checking rst doc files and Jupyter notebooks

Hydra exploration

Discussing whether Trajectory types be used for other uses than the imitation learning

← Metadata

Owner

Metadata

imitation imitation copied to clipboard

Metadata

← Metadata

Owner

Metadata

imitation
imitation copied to clipboard