puffo
puffo
I frequently need to pull the latest version of models I've already downloaded. Taking inspiration the comments and suggestions in https://github.com/jmorganca/ollama/issues/1890, I've implemented a basic `sync` command to streamline this...
The [previous fix](https://github.com/BerriAI/litellm/pull/1495) was flawed as the /chat API is different from the /generate API. While fixing the regression, I noticed that there is inconsistent behaviour in Ollama where the...
I noticed some odd behaviour when working with ollama (via litellm as I have been trying to fix a bug in the integration over there). The `prompt_eval_count` parameter disappears from...