SunJin Kim
Results
1
comments of
SunJin Kim
Has been implemented in https://github.com/agentica-project/verl-pipeline, which was used to make the the [DeepCoder-14B](https://pretty-radio-b75.notion.site/DeepCoder-A-Fully-Open-Source-14B-Coder-at-O3-mini-Level-1cf81902c14680b3bee5eb349a512a51) model. They saw 2.5x speedup in code RL training. 