langchain-benchmarks icon indicating copy to clipboard operation
langchain-benchmarks copied to clipboard

[Feature req] Agents: comparing fine-tuning techniques

Open hinthornw opened this issue 1 year ago • 0 comments

Common question: I'm fine-tuning for an agent. What split of data should I prioritize collecting, and in what mixture?

  • Fewer long trajectories?
  • More short trajectories / single-step function calls?
  • If there is conversation in the mix, how much should I include? And then do i include the full trajectory flattened out? or remove for later calls?

hinthornw avatar Dec 19 '23 22:12 hinthornw