Gym
Gym copied to clipboard
single-step unsloth nemo gym notebook
Single step tutorial for GRPO with unsloth using nemo gym verifier!
Addresses https://github.com/NVIDIA-NeMo/Gym/issues/370
This pull request requires additional validation before any workflows can run on NVIDIA's runners.
Pull request vetters can view their responsibilities here.
Contributors can view more details about this message here.
Merged into Unsloth notebooks repo here instead https://github.com/unslothai/notebooks/blob/main/nb/nemo_gym_sudoku.ipynb