D4RL issues

Discrepancy between results reported in CQL and D4rl papers

5

Hi, I notice there are differences between results reported in CQL paper and D4RL paper for this benchmark. Since some of the authors are common for both papers, can you...

rasoolfa

Updating Dataset Reproducibility Guide with mujoco environment versions

Currently the [reproducibility guide](https://github.com/rail-berkeley/d4rl/wiki/Dataset-Reproducibility-Guide) in the Mujoco section doesn't contain information about which version of the environments were used (`Hopper-v2` vs `Hopper-v3`), nor this is present anywhere else in the...

micahcarroll

Mujoco version used in the dataset collection

What `mujoco` version is used for collecting the datasets? I observed that the latest gym version discontinued support for `mujoco>=2.0` whereas `mujoco_py` readme file shows instructions for installing `mujoco 2.0`....

JayanthRR

What is task-specific debugging "infos" for kitchen environment?

Hi, as stated in the README, "infos" contained in each dataset is the task-specific debugging information, but what is it exactly for kitchen environment? And where can I find descriptions...

yquantao

Support for PyBullet Simulator Envionrment

Hello, Thank you for making this available. However, holding MuJoCo as a dependency restricts open and free research. MuJoCo has a heavy license fee and private student licenses can't be...

shivanimall

A single terminal flag in half cheetah environment

There seems to be a single terminal = true flag in each of the half cheetah datasets. Do you know why this is the case? The half cheetah gym environment...

ancorso

Ways to recover trajectories in dataset

2

First, thank you for sharing the repo! The dataset seems to consists of state-action pairs, is there a way to recover entire rollout of a policy?

masonjar-source

Propensities of logging policy available?

Hi, I could not find the propensities of the logging policy in the dataset. Can they be made available, since importance weighted methods would benefit from that knowledge? Thanks!

kaiwenw

In MuJoCo dataset, how can we figure out the beginning of each episode?

1

Hi, I would like to use d4rl dataset for DICE scenarios, where sampling from initial states is required. I thought the termination flag could be helpful at first glance, but...

wsjeon

Unable to run antmaze_sac.py nor antmaze_sac_rlkit.py

1

The RLkit version changed a lot and for example FlattenMlp does not exist anymore. Any recommendation which RLkit version is compatible with your repository? Thank you in advance. :) Best

Xpitfire

D4RL
D4RL copied to clipboard

Metadata

Discrepancy between results reported in CQL and D4rl papers

Updating Dataset Reproducibility Guide with mujoco environment versions

Mujoco version used in the dataset collection

What is task-specific debugging "infos" for kitchen environment?

Support for PyBullet Simulator Envionrment

A single terminal flag in half cheetah environment

Ways to recover trajectories in dataset

Propensities of logging policy available?

In MuJoCo dataset, how can we figure out the beginning of each episode?

Unable to run antmaze_sac.py nor antmaze_sac_rlkit.py

← Metadata

Owner

Metadata

D4RL D4RL copied to clipboard

Metadata

← Metadata

Owner

Metadata

D4RL
D4RL copied to clipboard