Practical_RL issues

Results 44 Practical_RL issues

Sort by recently updated

Broken links

I used the following two commands to identify broken links. `markdown-link-check` is https://github.com/tcort/markdown-link-check ``` bash find ./Practical_RL/ -type f -name '*.ipynb' -exec jupyter nbconvert --to markdown {} \; find ./Practical_RL/...

yagudin

maintenance

text in notebooks

Pseudocode for "better" policy evaluation in CEM

The end of the notebook suggests evaluating the policy in a "theoretically better" way by sampling an initial action for each initial state uniformly and then playing with the current...

dniku

text in notebooks

Use torch.gather instead of to_one_hot for selecting by indices

Example for `week6/a2c-optional`: actions = torch.tensor(trajectory['actions'], device=device) action_log_probs = torch.gather( trajectory['log_probs'], dim=-1, index=actions.unsqueeze(-1)).squeeze(-1) This _should_ work more efficiently than `to_one_hot`, although to be certain we could also benchmark it.

dniku

maintenance

priority:low

Coursera Week 2 Honor assignment

dniku

coursera

Make sure we don't lose images again like we did with postimg.org -> postimg.cc

via @yhn112: Let's choose one of: 1) embed images to notebooks 2) host images in our repo 3) host images on some image hosting 4) find all the images on...

dniku

maintenance

handle getter and setter in a more pythonic way

in [this file](https://github.com/yandexdataschool/Practical_RL/blob/master/week3_model_free/seminar_qlearning.ipynb) Where we have the get_qvalue() and set_qvalue() functions in the 2nd code block. This is not really a recommended way to handle getters and setters in python...

Ridhwanluthra

priority:low

Make Pacman visualization work from Docker and/or Colab

There are two options how this can be done: * Cleanup `spring19-pacman-visualization` and merge into `spring19` * Use one of the existing Gym implementations of Pacman: * https://github.com/sohamghosh121/PacmanGym * https://github.com/andreykurenkov/pacman-env

dniku

priority:low

Make git pre-commit hook to cleanup notebook metadata

Currently there is a lot of garbage in metadata. Some notebooks refer to nonexistent kernels (like `rl`), others were created with Python 2 and throw an error message if you...

dniku

maintenance

Test readme-md-generator on our repo

https://github.com/kefranabg/readme-md-generator produces fancy READMEs, maybe we could borrow from those.

dniku

priority:low

Update Amazon GPU howto.md

It mentions Theano, which is probably not what we care about most in 2019.

dniku

Practical_RL
Practical_RL copied to clipboard

Metadata

Broken links

Pseudocode for "better" policy evaluation in CEM

Use torch.gather instead of to_one_hot for selecting by indices

Coursera Week 2 Honor assignment

Make sure we don't lose images again like we did with postimg.org -> postimg.cc

handle getter and setter in a more pythonic way

Make Pacman visualization work from Docker and/or Colab

Make git pre-commit hook to cleanup notebook metadata

Test readme-md-generator on our repo

Update Amazon GPU howto.md

← Metadata

Owner

Metadata

Practical_RL Practical_RL copied to clipboard

Metadata

← Metadata

Owner

Metadata

Practical_RL
Practical_RL copied to clipboard