CrossQ
CrossQ copied to clipboard
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
Hello, I am trying to benchmark your code on more tasks from deepmind/* but they are not working. There seems to be a bug in the `prepare_obs` function in `sbx/common/policies.py`....
Hello, When running the code on deepmind/pendulum-swingup the training crashes as the action becomes `nan`. I attach stack trace below (I added some more logging to catch exactly which part...
Hello, In your code you correctly specified: ``` os.environ['XLA_PYTHON_CLIENT_PREALLOCATE'] = 'false' ``` However currently this line of code is after `import jax` therefore is not working. I fixed this in...
Hello, Setting the flag: ``` os.environ['XLA_PYTHON_CLIENT_PREALLOCATE'] = 'false' ``` did not correctly impact JAX, as it was done after the import. Therefore JAX allocated ~80% of memory of the GPU....
My name is seungju kim, and I am a reinforcement learning researcher. I recently read your paper. I found your work to be incredibly insightful and highly relevant to my...