Ziyang Huang comments

Results 6 comments of


                                            Ziyang Huang

Can not run 13B inference model. After loading the ckpt, it just stoped and the gpus are still occupied.

> Sounds like it is indeed running the inference model. It will hang there till it actually completes its inference. If the GPU's are occupied that sounds like its doing...

llama7B issue

> In my training attempt (full model fine tuning with llama on HH datasets), in SFT stage, the training loss is not going down. In DPO, all outputs are NAN....

Can you provide a search_r1 example for agent-linghting v2?

FYI: I run `python search_r1_agent.py`, then `bash train.sh`. It hang like this:

Can you provide a search_r1 example for agent-linghting v2?

It is just like this, no dialogue. I have not read the code, so i do not know what happended😂

Can you provide a search_r1 example for agent-linghting v2?

@SiyunZhao Hi, I use agl v.0.1.2 to reproduce the search_r1. I use the env set in scripts/setup_stable_gpu.sh. But it kept throw errors like: Do you have any ideas on this...

How to run agentic rl with agent lightning in the distributed training environment with multi-nodes and multi-gpus ?

Do you run successfully with multi nodes? @wendongbi