xavier-owkin

Results 3 issues of xavier-owkin

In Chapter 11 of the course, in the file **FineTuning with SFTTrainer** (`3.mdx`), you explain how to fine-tune a DeepSeek model with `SFTTrainer` on an instruction dataset. Why don't you...

When trying to train a langgraph agent with GRPO, I observe the following warnings when the agent uses the `with_structured_output` function of langchain ``` (TaskRunner pid=2743926) Warning: Reward is None...

adapter
tracer

Hello, is it planned to release the `numpy