verl icon indicating copy to clipboard operation
verl copied to clipboard

[RFC] Integrate VeOmni as the training engine in VERL

Open A1waysBeenHere opened this issue 2 months ago • 1 comments

Feature request

This RFC proposes integrating VeOmni as a training engine backend for VERL.
The goal is to leverage VeOmni’s high-performance distributed training framework to enhance VERL’s scalability and efficiency in large-scale RLHF and post-training workflows.

Motivation

  • Support FSDP2+EP, enabling VERL to run large MoE models easily on FSDP2 without relying on Megatron.
  • Introduce GroupGemm ops for moe and integrat Liger-Kernel for higher performance.
  • Scale any omni-models easily.

Your contribution

I am going to submitting a PR. #4072

A1waysBeenHere avatar Oct 27 '25 02:10 A1waysBeenHere

cc @vermouth1992 @wuxibin89

ji-huazhong avatar Oct 27 '25 02:10 ji-huazhong