Guoqing Liu
Guoqing Liu
Thanks for such clear code! l have some questions on the dataset provided by https://drive.google.com/drive/folders/1h3H4AY_ZBx08hz-Ct0Nxxus-V1melu1U. Does this dataset contains multiple full episodes or subsample the episodes as openai/imitation did? Thanks!
hi, Thanks for code sharing! after reading the source code, l have such questions, could you help me better understand the code? 1. why design ac_noise? rather than deterministic action...
Dear authors, Thanks for sharing this high-quality code. l create expert policies by running scripts/scripts/run_rl_mj.py on CartPole-v0 successfully, but when l run the MountainCar-v0 with the same script, the score...
Dear authors, Thanks for sharing the code, and some checkpoints about this great work. l am wondering how you trained these checkpoint (i.e., `models/P2R/USPTO_50K_P2R.pt`), via 1) pretrain-finetune, or 2) train...
Dear authors, Thanks for releasing your code. Regarding the top-20, and top-50 results in the Readme file, can you tell how did you obtain this? 