PufferLib
PufferLib copied to clipboard
Fix Metta's Env Training
This PR fixes the issues arising because of improper integration of Metta's env. It properly integrates the env and achieves the convergence in training.