llama.cpp
llama.cpp copied to clipboard
LLM inference in C/C++
### Prerequisites - [X] I am running the latest code. Mention the version if possible as well. - [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md). - [X] I searched using keywords...
This one looks promising.. https://huggingface.co/omkarthawakar/LlamaV-o1
### Prerequisites - [X] I am running the latest code. Mention the version if possible as well. - [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md). - [X] I searched using keywords...
This is intended to be a discussion on adding MCP SSE Client support to the current WebUI. I have not started migrating the demo included into the WebUI, but the...
### Git commit ecebbd292d741ac084cf248146b2cfb17002aa1d ### Operating systems Linux, Other? (Please let us know in description) ### GGML backends CUDA ### Problem description & steps to reproduce I'm trying to compile...
I've had these changes locally for quite a while and more or less forgotten about them, it's high time to make a PR as it fixes a few issues (endianess...
this commit fixes this error: ```python Traceback (most recent call last): File "/home/poweruser/python-goddamn-venv/bin/gguf-dump", line 8, in sys.exit(gguf_dump_entrypoint()) ^^^^^^^^^^^^^^^^^^^^^^ File "/home/poweruser/python-goddamn-venv/lib/python3.12/site-packages/gguf/scripts/gguf_dump.py", line 450, in main dump_metadata(reader, args) File "/home/poweruser/python-goddamn-venv/lib/python3.12/site-packages/gguf/scripts/gguf_dump.py", line 35,...
Refactoring of chat / tool-call logic (follow up to https://github.com/ggml-org/llama.cpp/pull/11016) along the lines of @ggerganov's suggestions ([ref](https://github.com/ggml-org/llama.cpp/pull/11016#issuecomment-2660837676)): - Move all `common_chat_*` from `common.*` to `common/chat.*` - Moved minja headers to...
Sync with latest upstream https://github.com/yhirose/cpp-httplib Should fix #11335
JSON files must be UTF-8 encoded, see [8.1 in RFC 8259](https://www.rfc-editor.org/rfc/rfc8259#section-8.1).