xavier-owkin issues

Repositories
Issues
Comments

Results 3 issues of


                                            xavier-owkin

Use DataCollatorForCompletionOnlyLM in order to train LLM to follow instructions

In Chapter 11 of the course, in the file **FineTuning with SFTTrainer** (`3.mdx`), you explain how to fine-tune a DeepSeek model with `SFTTrainer` on an instruction dataset. Why don't you...

Empty triplets when using `with_structured_output` in a langchain agent

When trying to train a langgraph agent with GRPO, I observe the following warnings when the agent uses the `with_structured_output` function of langchain ``` (TaskRunner pid=2743926) Warning: Reward is None...

adapter

tracer

Support for numpy>2.0

Hello, is it planned to release the `numpy