torchrl
torchrl copied to clipboard
Ability to record and store trajectories
Storing trajectories off-policy is helpful for algorithms which learn off policy. This may or may not be needed.
Need this for Backplay.