corax
corax copied to clipboard
Corax: Core RL in JAX
We currently only support the proto agent's dataset from ExoRL. We should add other agent's data as well.
Currently, we only have a good example for IQL with D4RL. We should consider refactoring existing or adding examples for other offline RL algorithms.
V-D4RL supports both distracting and multi-task datasets. We should add support for those as well. It is probably straightforward to reuse existing code for V-D4RL main.