cmunley1 issues

Results 11 issues of


                                            cmunley1

system oom with qwen 235b

**Describe the bug** CPU memory usage steadily increasing until OOM. Qwen 235b a22b. OOMs at the end of this chart. Customer-reported, do not have full reproducer yet, but the RL...

bug

high logprob error with qwen30b a3b gspo

**Describe the bug** Large logprob erors with qwen30b a3b with gspo ``` grpo: num_prompts_per_step: 256 num_generations_per_prompt: 16 loss_fn: reference_policy_kl_penalty: 0 ratio_clip_min: 3e-4 ratio_clip_max: 4e-4 ratio_clip_c: null use_on_policy_kl_approximation: false use_importance_sampling_correction: false...

bug

python flag for colab venv installation

need to set uv pip install python flag in colab environments when launching servers usage: `ng_run "+config_paths=[...]" +uv_pip_set_python=true ` defaults to false For https://github.com/NVIDIA-NeMo/Gym/issues/370 Needed for notebook here: https://docs.unsloth.ai/models/nemotron-3#reinforcement-learning--nemo-gym

cmunley1

system oom with qwen 235b

high logprob error with qwen30b a3b gspo

python flag for colab venv installation

single-step unsloth nemo gym notebook

NeMo-Agent-Toolkit Integration

aime resources server

Salesforce xlam-function-calling-60k resources server

add unsloth and trl to docs

openai-compatible responses and chat completions endpoints in vllm_serve.py

arc-agi resource server