CrossQ icon indicating copy to clipboard operation
CrossQ copied to clipboard

Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"

Results 5 CrossQ issues
Sort by recently updated
recently updated
newest added

Hello, I am trying to benchmark your code on more tasks from deepmind/* but they are not working. There seems to be a bug in the `prepare_obs` function in `sbx/common/policies.py`....

Hello, When running the code on deepmind/pendulum-swingup the training crashes as the action becomes `nan`. I attach stack trace below (I added some more logging to catch exactly which part...

Hello, In your code you correctly specified: ``` os.environ['XLA_PYTHON_CLIENT_PREALLOCATE'] = 'false' ``` However currently this line of code is after `import jax` therefore is not working. I fixed this in...

Hello, Setting the flag: ``` os.environ['XLA_PYTHON_CLIENT_PREALLOCATE'] = 'false' ``` did not correctly impact JAX, as it was done after the import. Therefore JAX allocated ~80% of memory of the GPU....

My name is seungju kim, and I am a reinforcement learning researcher. I recently read your paper. I found your work to be incredibly insightful and highly relevant to my...