llama-api-server
llama-api-server copied to clipboard
A OpenAI API compatible REST server for llama.
Bumps [black](https://github.com/psf/black) from 23.3.0 to 24.3.0. Release notes Sourced from black's releases. 24.3.0 Highlights This release is a milestone: it fixes Black's first CVE security vulnerability. If you run Black...
Bumps [idna](https://github.com/kjd/idna) from 3.4 to 3.7. Release notes Sourced from idna's releases. v3.7 What's Changed Fix issue where specially crafted inputs to encode() could take exceptionally long amount of time...
Bumps [hiq-python](https://github.com/oracle/hiq) from 1.1.11 to 1.1.12. Commits See full diff in compare view [](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) You can trigger a rebase of this PR by commenting `@dependabot rebase`. [//]: #...
Bumps [markdown-it-py](https://github.com/executablebooks/markdown-it-py) from 2.2.0 to 3.0.0. Release notes Sourced from markdown-it-py's releases. v3.0.0 Release ⚠️ This release contains some minor breaking changes in the internal API and improvements to the...
The version of llama-cpp-python this project uses is quite old. Therefore I get a lot of errors regarding versions of GGML models. It also doesn't support GGUF models. I would...
Hi. has anyone come up with a unified, standard API to talk to all LLMs? (LLaMA, Bard, OpenAI etc)? if not, it would be helpful to start defining that as...
Bumps [zipp](https://github.com/jaraco/zipp) from 3.15.0 to 3.19.1. Changelog Sourced from zipp's changelog. v3.19.1 Bugfixes Improved handling of malformed zip files. (#119) v3.19.0 Features Implement is_symlink. (#117) v3.18.2 No significant changes. v3.18.1...
Bumps [certifi](https://github.com/certifi/python-certifi) from 2023.7.22 to 2024.7.4. Commits bd81538 2024.07.04 (#295) 06a2cbf Bump peter-evans/create-pull-request from 6.0.5 to 6.1.0 (#294) 13bba02 Bump actions/checkout from 4.1.6 to 4.1.7 (#293) e8abcd0 Bump pypa/gh-action-pypi-publish from...
Bumps [transformers](https://github.com/huggingface/transformers) from 4.32.1 to 4.42.3. Release notes Sourced from transformers's releases. Patch release v4.42.3 Make sure we have attention softcapping for "eager" GEMMA2 model After experimenting, we noticed that...
Bumps [hiq-python](https://github.com/oracle/hiq) from 1.1.11 to 1.1.13. Commits See full diff in compare view [](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter...