Adil issues

Results 11 issues of


                                            Adil

adding inference utility to do nemo2 ckpt loading without PTL and usi…

Inference util to go from NeMo2 cpkt -> MCore model loading for inference using MCore utils. > [!IMPORTANT] > The `Update branch` button must only be pressed in very rare...

feat: DTensorPolicyV2 GPT-OSS support

# What does this PR do ? Adds GPT-OSS SFT using AutoModel custom models + DeepEP. To run, launch the nightly container and run ``` NRL_FORCE_REBUILD_VENVS=true uv run examples/run_sft.py --config...

CI:L0

feat: Automodel init for DTensorPolicyV2

# What does this PR do ? Uses Automodel's FSDP2 manager for initializing the v2 worker. Sharding on current main: ``` 2025-11-12 16:03:44 (DTensorPolicyWorkerV2 pid=1247213) ================================================================================ 2025-11-12 16:03:44 (DTensorPolicyWorkerV2 pid=1247213)...

CI:L2

feat: fp16 for DTensor policies

# What does this PR do ? Adds fp16 for policy training https://wandb.ai/nvidia/automodel-rl/workspace?nw=6pzs4djqn28 The wandb above shows BF16 (v1 policy) and FP16 (v1 & v2 policies)

make `force_hf_flag` force vanilla Transformers

Currently the flag will force the user off of custom models but still apply automodel specific perf opts like liger kernel, diff attention backend, etc. We want to make the...

bug

feat: add loss comparison for nightly tests

Example run (here it just compares to itself, but the output would be JSONL from the test run): ``` pytest tests/functional_tests/gt_metrics/test_log_compare.py --ground-truth-jsonl tests/functional_tests/gt_metrics/gpt_oss_20b_te_deepep_train_EP_8.jsonl --compare-jsonl tests/functional_tests/gt_metrics/gpt_oss_20b_te_deepep_train_EP_8.jsonl ```

Adil

adding inference utility to do nemo2 ckpt loading without PTL and usi…

feat: DTensorPolicyV2 GPT-OSS support

feat: Automodel init for DTensorPolicyV2

feat: fp16 for DTensor policies

make `force_hf_flag` force vanilla Transformers

feat: add loss comparison for nightly tests

add loss comparison for nightly CI

Auto dequantization for HF models

MCore Dataset needs to launch with uv instead of system python

Make StatefulDataloader be DP-aware