openai-cookbook
openai-cookbook copied to clipboard
Can customized loss be supported in fine-tuning
trafficstars
Hi there,
It seems that only classification tasks and generation tasks can be fine-tuned based on fine-tunes API. Am I right? Is it possible for you to support much more fine-tuning tasks like reward model in paper Learning to summarize from human feedback which is implemented with pairwise loss.
I have the same problem
That's right. We do not support that today.