PufferLib
PufferLib copied to clipboard
Fixed Spaces so it now converges
Fixed Spaces env: dtype=np.int8 -> dtype=np.float32, which was causing model to fail to learn. Reset dtype updated for consistency.