llama.cpp issues

alaways "failed to tokenize string! "

6

failed to tokenize string! system_info: n_threads = 16 / 16 | AVX = 1 | AVX2 = 1 | AVX512 = 0 | FMA = 1 | NEON = 0...

bug

Fix scripts to support cross-platform execution

This change modifies the `quantize.sh` script so that it can run properly on different platforms (including the Windows platform in the WSL environment).

lanytcc

bugfix: centos 7, gcc (GCC) 11.2.1 20220127 (Red Hat 11.2.1-9)

bugfix: std::string mesh up vocab. OS: CentOS 7 compiler: gcc (GCC) 11.2.1 20220127 (Red Hat 11.2.1-9)

OvJat

bug

Add embedding mode with arg flag. Currently working

14

Hi everyone, I took a stab at adding embedding mode, where we print the sentence embedding for the input instead of generating more tokens. If I only add the compute...

StrikingLoo

enhancement

Can this code base be extended to support other transformer-based LLMs such as Pythia or its instruction-tuned version Open Assistant?

8

michaelbogdan

enhancement

question

model

Proof of concept TCP server mode

20

This builds on my [other PR](https://github.com/ggerganov/llama.cpp/pull/267) to implement a very simple TCP mode. The new mode first loads the model then listens for TCP connections on a port. When a...

tarruda

enhancement

Update README.md

Add: https://github.com/gyunggyung/OpenMLLM Use: https://github.com/gyunggyung/KoAlpaca.cpp

gyunggyung

Added script to invoke alpaca model

1

Resolves https://github.com/ggerganov/llama.cpp/issues/240 WIP This needs to be able to: 1. Configure custom model folders. 2. Adjust settings for running variants of the Alpaca model and make corresponding changes in the...

nullhook

enhancement

🦙.

model

Compute perplexity over prompt

27

This is a prototype of computing perplexity over the prompt input. It does so by using `n_ctx - 1` tokens as the input to the model, and computes the softmax...

glinscott

enhancement

generation quality

Error while converting to ggml.py format

1

After running the command: "python3 convert-pth-to-ggml.py /Users/tanish.shah/llama.cpp/models/7B/ 1" Error with sentencepiece: ``` Traceback (most recent call last): File "/Users/tanish.shah/llama.cpp/convert-pth-to-ggml.py", line 75, in tokenizer = sentencepiece.SentencePieceProcessor(fname_tokenizer) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/tanish.shah/llama.cpp/env/lib/python3.11/site-packages/sentencepiece/__init__.py", line 447,...

tanishhshahh

need more info

llama.cpp
llama.cpp copied to clipboard

Metadata

alaways "failed to tokenize string! "

Fix scripts to support cross-platform execution

bugfix: centos 7, gcc (GCC) 11.2.1 20220127 (Red Hat 11.2.1-9)

Add embedding mode with arg flag. Currently working

Can this code base be extended to support other transformer-based LLMs such as Pythia or its instruction-tuned version Open Assistant?

Proof of concept TCP server mode

Update README.md

Added script to invoke alpaca model

Compute perplexity over prompt

Error while converting to ggml.py format

← Metadata

Owner

Metadata

llama.cpp llama.cpp copied to clipboard

Metadata

← Metadata

Owner

Metadata

llama.cpp
llama.cpp copied to clipboard