Xingyuan Zhang

Results 4 comments of Xingyuan Zhang

Hi guys. I just go through the code, and I find the setup of NumPy seed is missing in the `learn_model.py` script which will cause the results to be unreproducible.

I agree with your point. In real-world scenarios, the reward function and the terminal function are available in the most cases (MDP settings and some POMDP settings). I guess future...

LGTM. cc @mzktbyjc2016 for merging.

Hi, I just cross check the CQL scores reported in D4RL (arXiv-v4) and CQL (NeurIPS) papers, there are few mismatches. |Task|D4RL (arXiv-v4)|CQL (NeurIPS)| |------|--------|--------------| |walker2d-medium|79.2|74.5| |hopper-medium|58.0|86.6| |walker2d-medium-replay|26.7|32.6| I hope you...