agent-lightning icon indicating copy to clipboard operation
agent-lightning copied to clipboard

Can you provide a search_r1 example for agent-linghting v2?

Open hzy312 opened this issue 2 months ago • 6 comments

I can see the example of search_r1 is for v1 api, such as n_workers (n_runners in v2) and url training dataset? Can you explain the new features? Can you provide a search_r1 example for agl v2? Thanks.

hzy312 avatar Oct 27 '25 06:10 hzy312

@SiyunZhao do you have time for the migration?

ultmaster avatar Oct 27 '25 08:10 ultmaster

FYI: I run python search_r1_agent.py, then bash train.sh. It hang like this:

Image

hzy312 avatar Oct 27 '25 08:10 hzy312

@SiyunZhao do you have time for the migration?

Yes, I’m available to do the migration.

SiyunZhao avatar Oct 27 '25 08:10 SiyunZhao

FYI: I run python search_r1_agent.py, then bash train.sh. It hang like this:

Image

Seems like it’s hanging during validation. Do you have any logs printed from search_r1_agent.py showing the dialogue?

SiyunZhao avatar Oct 27 '25 08:10 SiyunZhao

It is just like this, no dialogue. I have not read the code, so i do not know what happended😂

Image

hzy312 avatar Oct 27 '25 09:10 hzy312

@SiyunZhao Hi, I use agl v.0.1.2 to reproduce the search_r1. I use the env set in scripts/setup_stable_gpu.sh. But it kept throw errors like:

Image

Do you have any ideas on this pydantic error for numpy.ndarray?

hzy312 avatar Oct 28 '25 07:10 hzy312