seven8827
seven8827
How can we obtain the **ground truth** of target policy. How can we compute the **RMSE error** as in Figure 1, 2, 3, 4, 5 in the paper https://arxiv.org/abs/1906.04733 As...
How to delete a landmark when training. Thanks a lot.
I run python main.py I got the error: Traceback (most recent call last): File "main.py", line 124, in main() File "main.py", line 76, in main predicted_state_std=predicted_state_std) File "/home/lq/Data-Efficient-Reinforcement-Learning-with-Probabilistic-Model-Predictive-Control/control_objects/gp_mpc_controller.py", line 806,...
Thanks for your work. I want to know how can I run this project. Thanks a lot!
### Question Hi, I have a question on the state of Blackjack-v1. In Blackjack-v1: 1. env.observation_space = Tuple(Discrete(32), Discrete(11), Discrete(2)) 2. However, I got a state `(**47**, 8, False)`. **The...
Hi Thanks for your great work. I can run the project successfully following the README.md. How to test this project and make a visualization of the game video ? Can...
Hi, I run `python -m alpha_codium.solve_problem --dataset_name /workspace/xxx/codes/AlphaCodium/valid_and_test_processed --split_name test --problem_number 1` It always shows the problem: `2024-03-28 14:14:21.151 | INFO | alpha_codium.gen.coding_competitor:solve_problem:116 - problem_name: 1575_B. Building an Amusement Park...
Hi, I run this project successfully. I need to change the Allegro Hand model to the other hand model. However, when I edit the code of `def compute_sim2real_observation(self, rand_floats): `...