sublayer icon indicating copy to clipboard operation
sublayer copied to clipboard

Multi-model providers break the universal function calling expectation inside providers

Open swerner opened this issue 9 months ago • 0 comments

For model providers like Groq, Local, OpenRouter, etc, each different model you might want to use inside these different services could have different function calling/tool calling mechanism and quirks: Specifically I just tried generating something with Groq+llama3 and the xml response came back in a different format.

For now, going to move away from providing direct solutions for things like Groq and make it similar to what we're doing with output_adapters where you can specify a custom AI provider in your code to use and implement the specific quirks of that model. Over time we'll find patterns that work with things like Llama3 or Mistral or Hermes2 that we can provide as bases.

swerner avatar Apr 29 '24 20:04 swerner