llm Workaround for litellm proxy -> ollama issue

Workaround for litellm proxy -> ollama issue

Open clevcode opened this issue 1 year ago • 4 comments

When streaming responses from a model served by ollama, through a litellm proxy that provides an OpenAI compatible API, the last chunk (i.e. choice["delta"]["content"]) of the streaming response will be set to None

This causes an exception to be thrown during the following, since None can't be appended to a string: content += choice["delta"]["content"]

To avoid this, I simply added a check so that choice["delta"]["content"] is only appended to content if it's not None.

Dec 17 '23 12:12 clevcode

I've researched the issue more now, and it's the very last chunk, containing the finish_reason field, that in the case of at least LiteLLM:s implementation of the OpenAI API, also sets the 'delta' field to {content: null, role: null} rather than just not setting the delta field at all in this case.

I think it's a good idea for this project to be resilient to that type of slight deviation from how the real OpenAI endpoints might respond, but I also think it should be fixed upstream in the litellm repo. I've already made some pull requests for a couple of other issues there when using litellm with ollama.

Dec 17 '23 18:12 clevcode

@simonw - any chance of getting this PR merged? I think it makes sense for llm to be defensive in this way. I'm currently vendoring my llm as I really need this change.

@clevcode - do you have plans to add a fix that eliminates the Nones to litellm directly? Lmk if I can help with this

Jan 15 '24 18:01 cmungall

Sorry for not getting this into 0.13 - this will need some tweaks now that I've upgraded the OpenAI Python library.

Jan 27 '24 00:01 simonw

Hey @clevcode i'm the maintainer of litellm

Tracking this issue on our end as well - https://github.com/BerriAI/litellm/issues/2010

Thanks for raising it.

Feb 16 '24 18:02 krrishdholakia

llm llm copied to clipboard

Workaround for litellm proxy -> ollama issue

llm
llm copied to clipboard