Yuge Zhang

Results 279 comments of Yuge Zhang

I think the problem is related to vllm. If you are using vllm >= 0.10.2, please verify whether return_token_ids + structured output (called guided decoding) works vllm's native chat completion...

It's possible that structured output didn't use the chat completion, thus won't be captured at client side. But since the proxy server side is universal, the raw_gen_ai_request always exist. In...

Yes. You can grep the example folder for examples.

I think it's related to #31

I think it's slightly different. In GRPO for RLHF, each trajectory = one generated response. In our setup, each trajectory = multiple responses.

Hi. I'm not familiar with Qwen Agent. My questions are: 1. Do they work with OpenAI's chat completion API? 2. Are they traceable via observability tools like AgentOps or Weave?

I think the answer is yes and no. We do plans to support agents written in other languages, like the undergoing work with Claude Code and Codex. Right now these...

We might want a Vercel AI SDK example after finalizing store RESTful API in v0.3.