stable-baselines3
stable-baselines3 copied to clipboard
use lerp for polyak updates
trafficstars
slightly increases numerical precision and speed
ideally, we'd put this into a torch.compile-d block, but it's not clear whether SB3 wants to support/use compile.