Cal-QL issues

Online value function divergence for cql

1

Hi, thanks for your work! When I try cql in the pen binary environment, I find that for cql's value function always tend to diverge (tried mixing ratio 0.0 and...

zhonghai1995

Hyperparameters for D4RL locomotion?

1

Hi, Thanks for providing your code. My question is regarding the results in Appendix D. What hyperparameter configuration is used for these tasks? Thanks.

trevormcinroe

Error when run scripts

Hi @nakamotoo , Thank you for your amazing works. I am running your code. I had finished installing the environment based on the instruction in README. And when I run...

linhlpv

Question about the configuration dictionary for the default high/low rewrd values for each envs

As inspecting through your codes, I found there is a function `cal_return_to_go` which requires a config dictionary for the high/low reward values for each env. What is its purpose and...

HYDesmondLiu

[Question] Is MacOS supported?

Hi Mitsuhiko, I have a few questions about replicating the repo as I am on MacOS. Not sure what to do with the "nvidia" or "cuda-nvcc". Add following environment variables...

zxp567

Cal-QL
Cal-QL copied to clipboard

Metadata

Online value function divergence for cql

Hyperparameters for D4RL locomotion?

Error when run scripts

Question about the configuration dictionary for the default high/low rewrd values for each envs

[Question] Is MacOS supported?

← Metadata

Owner

Metadata

Cal-QL Cal-QL copied to clipboard

Metadata

Online value function divergence for cql

Hyperparameters for D4RL locomotion?

Error when run scripts

Question about the configuration dictionary for the default high/low rewrd values for each envs

[Question] Is MacOS supported?

← Metadata

Owner

Metadata

Cal-QL
Cal-QL copied to clipboard