imitation issues

GAIL tensorboard logging does not appear to work

## Bug description I'm following along with the GAIL example, and am trying to log to tensorboard. If I understand the API correctly, I should be able to log directly...

eufrizz

bug

TypeError: argument of type 'PosixPath' is not iterable

1

## Bug description Hi everyone, I am trying to play with the DAgger example from `docs/tutorials/2_train_dagger.ipynb` but get the following error when executing the line `dagger_trainer.train(2000)`: ```bash Exception has occurred:...

den-schmidt

bug

`transitions_collate_fn` expects `next_obs` from `TransitionsMinimal`

## Bug description https://github.com/HumanCompatibleAI/imitation/blob/a8b079c469bb145d1954814f22488adff944aa0d/src/imitation/data/types.py#L473 While `TransitionsMinimal` doesn't have the field `next_obs`, the collate function called during data loading expects there to be `next_obs`, making `TransitionsMinimal` unusable with e.g. `BC`. ##...

namheegordonkim

bug

"The observation_space and action_space were not given, can't verify new environments" when loading expert policy

2

## Bug description KeyError: The observation_space and action_space were not given, can't verify new environments. ## Steps to reproduce The code to reproduce is from Imitation's official documents. (https://imitation.readthedocs.io/en/latest/tutorials/1_train_bc.html) ```python...

boshi-an

bug

Fix PosixPath being passed to HuggingFace `save_to_disk` (which does not accept it)

1

## Description HuggingFace save_to_disk takes `PathLike` type which is defined as str, bytes or os.PathLike. imitation.util.parse_path always returned pathlib.Path which is not one of these types. This commit converts pathlib.Path...

iwishiwasaneagle

How to Retrieve mean/gen/train/loss during GAIL Training and Create Checkpoints on New Minimum

## Question Hello, I am trying to implement a callback during GAIL training that retrieves the mean/gen/train/loss value and creates a checkpoint whenever this loss reaches a new minimum. However,...

kechirojp

bug

Unable to load FeedForward32Policy -- multiple values for keyword argument 'net_arch'

1

## Bug description A `imitation.policies.base.FeedForward32Policy` that is saved using `policy.save()` cannot be loaded with `imitation.policies.base.FeedForward32Policy.load()`, raising the following error: ## Steps to reproduce Train a policy using `imitation.algorithms.bc.BC`, then save...

glolichen

bug

Fix inability to load FeedForward32Policy

The temporary/stupid fix to the bug described in #857.

glolichen

Robust termination Condition for equal horizon episodes

## Bug description Currently for some reason the termination condition that equalizes the horizon length for each rollouts is not work properly and thus generates variable horizon error. ## Steps...

abastola0

bug

Bug in assert statement in RewardNet that prevents using dict observations even if properly linearized by a feature extractor.

## Bug description In predict_th, there's an assert statement that says the following: assert rew_th.shape == state.shape[:1] This will fail, even if you've modified state_th using self.preprocess to be a...

WilliamDormer

bug

imitation
imitation copied to clipboard

Metadata

GAIL tensorboard logging does not appear to work

TypeError: argument of type 'PosixPath' is not iterable

`transitions_collate_fn` expects `next_obs` from `TransitionsMinimal`

"The observation_space and action_space were not given, can't verify new environments" when loading expert policy

Fix PosixPath being passed to HuggingFace `save_to_disk` (which does not accept it)

How to Retrieve mean/gen/train/loss during GAIL Training and Create Checkpoints on New Minimum

Unable to load FeedForward32Policy -- multiple values for keyword argument 'net_arch'

Fix inability to load FeedForward32Policy

Robust termination Condition for equal horizon episodes

Bug in assert statement in RewardNet that prevents using dict observations even if properly linearized by a feature extractor.

← Metadata

Owner

Metadata

imitation imitation copied to clipboard

Metadata

← Metadata

Owner

Metadata

imitation
imitation copied to clipboard