Yuge Zhang comments

Results 279 comments of


                                            Yuge Zhang

Can you provide a search_r1 example for agent-linghting v2?

@SiyunZhao do you have time for the migration?

Add agentops client(explorter) patch

Please resolve the conflicts

Add agentops client(explorter) patch

/ci

Add agentops client(explorter) patch

/ci

Add agentops client(explorter) patch

@acured Please avoid force push. This will toss away the diff and make review difficult. When merging, we will squash. So the chaotic commit history on the feature branch does...

Will trajectory modifications that bypass the LLM API be lost from the trajectory and thus unavailable for RL training?

> If the current tracer cannot detect such modifications, what is the officially recommended solution—is it to trigger a dummy LLM call, register an MCP tool, or something else? You...

about credit assignment

You can use emit_reward to generate intermediate reward signals. However, current verl algorithm only supports identical credit assignment. For that part of customization, please refer to #31.

rag train

Looks like a host memory boom. Would you try scripts/restart_ray.sh to restart ray?

How to train a multi-agent video understanding task

We can't train two models at the same time due to a verl limitation. We can however train two agents alternatively by specifying the `trained_agents` parameter in LitAgent.

Fix: Port conflict in tracer tests

/ci