langchain-benchmarks
langchain-benchmarks copied to clipboard

Published 20 hours ago •

Reame
Issues

[Feature req] Agents: comparing fine-tuning techniques

Open hinthornw opened this issue 1 year ago • 0 comments

Common question: I'm fine-tuning for an agent. What split of data should I prioritize collecting, and in what mixture?

Fewer long trajectories?
More short trajectories / single-step function calls?
If there is conversation in the mix, how much should I include? And then do i include the full trajectory flattened out? or remove for later calls?

Dec 19 '23 22:12 hinthornw