Adam Gleave comments

Results 172 comments of


                                            Adam Gleave

Traceback (most recent call last)

Sounds like `v4l2loopback` is either not installed or not loaded as a kernel module. I believe this should work: ``` sudo modprobe v4l2loopback devices=1 exclusive_caps=1 ``` You can put it...

Traceback (most recent call last)

Interesting, what's the output of `v4l2-ctl --list-devices`? Do you see "Dummy video device" listed?

Traceback (most recent call last)

OK, well `platform:v4l2loopback-000` is the right thing but it seems to be called `'GoproLinux'` on your system for some reason -- I suspect there's a config file somewhere setting this?...

Image-based IL example

I don't think we have any examples beyond those in the `examples/` folder, although we'd be happy to see a PR adding these if you wanted to port qxcv's notebook.

RewardNet refactor

Thanks for writing this up, Lev! # Modes We were a bit hesitant about relying on modes previously as it seemed less explicit than "call the right function" and so...

RewardNet refactor

> Let's say I want to get the unshaped reward from a reward network. At the moment I have to check the net to see if it is an instance...

RewardNet refactor

> The only other concrete use case I can think of at the moment would be reward ensembles. When doing training we might want to detect if we are using...

Support image-based observation spaces in same way as SB3

> IMO the cleanest fix to this would remove the two algorithm/policy fixes from SB3. Instead, the user would be responsible for wrapping environments in a `VecImageTranspose` wrapper if necessary....

Algorithm wishlist (meta-issue)

I've added DRLHP. In general I'd like to not focus just on learning from demonstrations, given this has some severe limitations in terms of reward ambiguity.

Algorithm wishlist (meta-issue)

We now have a DRLHP implementation thanks to https://github.com/HumanCompatibleAI/imitation/pull/320 Would welcome PRs on the others, too, but will be prioritizing making the existing algorithms solid and easy to use and...