Yun Qu
Results
1
issues of
Yun Qu
### What does this PR do? > An implementation of MoPPS (Model Predictive Prompt Selection). A Bayesian framework for online predicting prompt difficulty to accelerate RL finetuning of Large Reasoning...