garage issues

Results 108 garage issues

Sort by recently updated

About details of MAML Pytorch

Hi, your code is a nice work but I am confused about some details of MAML Pytorch. In inner loop you update params of tasks and save it in `all...

Zhikaiiii

Test running ci in containers

Test if we can simplify the CI a great deal by using the github actions container option

ziyiwu9494

Support Both GPU and CPU With The Ray Sampler

Was looking into using the ray sampler with the gpu again this week because of its potential use with the evaluation samplers/meta evaluator. so to recap, what makes it tricky...

avnishn

feature

PyTorch on CPU is slower than TF

See https://github.com/pytorch/pytorch/issues/975 for more info PyTorch TRPO appears 50% slower than TF. Not sure about PPO, but I expect the wall-clock time gap will be the same. To fix this...

ryanjulian

pytorch

MAML performance on 2D navigation

Thank you for the clean and well-documented library! I am trying to use MAML for 2D navigation but have been achieving suboptimal policies. In particular, rollouts from the (adapted) trained...

kristian-georgiev

Inconsistent Usage between the Definition and Call of the Function _sample_params in cem.py

In the source code of [cem.py](https://github.com/rlworkgroup/garage/blob/master/src/garage/np/algos/cem.py), there is an inconsistent setting between the [Definition](https://github.com/rlworkgroup/garage/blob/master/src/garage/np/algos/cem.py#L68) and [Call](https://github.com/rlworkgroup/garage/blob/master/src/garage/np/algos/cem.py#L166) of the function ```_sample_params```. The detail is presented below: In Line 166, [the call...

ghost

Ensure Python 3.8 support

ryanjulian

quality

tests

packaging

[WiP] Reproducible on- and off-policy sampling

Extend the `Environment` API to support setting environment library specific seeds. Tasks: - [x] Extend `Environment` interface - [x] Set seeds for Gym envs - [x] Ensure seeds are set...

MkuuWaUjinga

update documentation on how to use rnns with tf/torch[pending]

the error a contributor got when using the `categoricalgrupolicy` with `TRPO` on the `tf` branch, computing backwards passes was ``` tensorflow.python.framework.errors_impl.InvalidArgumentError: Node 'optimize/hx_plain/gradients_hx_plain/ConjugateGradientOptimizer/update_opt_mean_kl/gradients_constraint/policy_1/gru/rnn_2/while_grad/policy_1/gru/rnn_2/while_grad_grad/ConjugateGradientOptimizer/update_opt_mean_kl/gradients_constraint/policy_1/gru/rnn_2/while_grad/policy_1/gru/rnn_2/while_grad_grad': Connecting to invalid output 78 of source...

avnishn

Rework logic for filling and checking replay buffer in torch sac, dog, and td3

Currently in sac, train once returns none if the replay buffer doesn't have the minimum number of timesteps in it. This function should still return some value or raise an...

avnishn

garage
garage copied to clipboard

Metadata

About details of MAML Pytorch

Test running ci in containers

Support Both GPU and CPU With The Ray Sampler

PyTorch on CPU is slower than TF

MAML performance on 2D navigation

Inconsistent Usage between the Definition and Call of the Function _sample_params in cem.py

Ensure Python 3.8 support

[WiP] Reproducible on- and off-policy sampling

update documentation on how to use rnns with tf/torch[pending]

Rework logic for filling and checking replay buffer in torch sac, dog, and td3

← Metadata

Owner

Metadata

garage garage copied to clipboard

Metadata

← Metadata

Owner

Metadata

garage
garage copied to clipboard