OC comments

Results 26 comments of

OC

[DONOTMERGE][rollout] feat: ChatScheduler requests sglang fully async

> WIth this async implementation, I find that an error would immediately occur at the beginning of training (step 1) right after the rollout process. see verl/verl/trainer/ppo/ray_trainer.py @chenhaiq @SwordFaith >...

[DONOTMERGE][rollout] feat: ChatScheduler requests sglang fully async

> @chenhaiq I have tried to run the example via `bash examples/grpo_trainer/run_qwen2-7b_seq_balance.sh `, but I got an error as below: `Exception: sgl-kernel is installed with version 0.1.0, which is less...

[DONOTMERGE][rollout] feat: ChatScheduler requests sglang fully async

> `Exception: sgl-kernel is installed with version 0.1.0, which is less than the minimum required version 0.1.1. Please reinstall the latest version with 'pip install sgl-kernel --force-reinstall'`. Note that I...

[DONOTMERGE][rollout] feat: ChatScheduler requests sglang fully async

> Title: state_dict() hangs when running run_qwen2-7b_seq_balance.sh with tp=2 fixed in https://github.com/volcengine/verl/pull/2098

[DONOTMERGE][rollout] feat: ChatScheduler requests sglang fully async

> @chenhaiq When I run the following command: NGINE=sglang ROLLOUT_MODE=async bash tests/special_e2e/ppo_trainer/run_function_reward.sh the program hangs during execution. I am using 2 GPUs, and all other settings and configurations are exactly...

Async pipeline in generate and compute score

> We have already implemented this feature, please check `reward_model.launch_reward_fn_async=True` argument yes，my picture in first post is assume reward_model.launch_reward_fn_async=True. In current implementation, reward is a batch task async with old_log_prob...

when running the official gsm8k with tool, multi turn async rollout sglang example without any modifications, the model crashes and appears Nan.

> I pulled the latest version of verl's code and when running the official gsm8k with tool, multi turn async rollout sglang example without any modifications, the model crashes for...

OC

[DONOTMERGE][rollout] feat: ChatScheduler requests sglang fully async

[DONOTMERGE][rollout] feat: ChatScheduler requests sglang fully async

[DONOTMERGE][rollout] feat: ChatScheduler requests sglang fully async

[DONOTMERGE][rollout] feat: ChatScheduler requests sglang fully async

[DONOTMERGE][rollout] feat: ChatScheduler requests sglang fully async

Async pipeline in generate and compute score

when running the official gsm8k with tool, multi turn async rollout sglang example without any modifications, the model crashes and appears Nan.

[rollout] feat: add vllm pipeline parallel support for zmq executor

[rollout] feat: add vllm pipeline parallel support for zmq executor

[Help] Weird Error Messages in vLLM Async Rollout