llama.cpp issues

Parallel Quantize.sh, add &

8

@prusnak `./quantize "$i" "${i/f16/q4_0}" 2 &`

tljstewart

enhancement

Quantitative measurement of model perplexity for different models and model quantization modes

49

llama.cpp seems to give bad results compared to Facebook's implementation. Here's an example simple reading comprehension prompt: > Question: "Tom, Mark, and Paul bought books: two with pictures and one...

noughtmare

model

generation quality

Create a logo

41

We should probably make a logo for this project. Like an image of a 🦙 and some C++

ggerganov

good first issue

🦙.

Installation Fails on M1 Mac Air

2

When I run the two commands the installer throws the following errors about halfway through the install: cc -I. -O3 -DNDEBUG -std=c11 -fPIC -pthread -DGGML_USE_ACCELERATE -c ggml.c -o ggml.o ggml.c:1364:25:...

LowRezSkyline

Any way to change context limit?

3

Is there any setting in any of the scripts to change the context limit? :) Thanks in advance!

shadowdoggie

Build on Debian Docker

4

Hello, wanted to experiment installing the system in a Linux/Debian container but I am getting the following error when I am issuing make. - "failed in call to 'always_inline' '_mm256_cvtph_ps'"...

meltoner

Unhandled exception: _Xlength_error("string too long")

3

Use cmake to create the vc++ project ,and debug in vs2022. python convert-pth-to-ggml.py models/7B/ 1 done. quantize.exe .\models\7B\ggml-model-f16.bin .\models\7B\ggml-model-q4_0.bin 2 done. llama -m .\models\7B\ggml-model-q4_0.bin -t 8 -n 128 > main:...

icewm

bug

android port of llama.cpp

10

@ggerganov , can we expect an android port like the whisper one?

GeorvityLabs

build

add ptread link to fix cmake build under linux

3

Fix the CMake build on Linux to prevent it from failing with an error message. ``` /usr/bin/ld: libggml.a(ggml.c.o): in function `ggml_graph_compute': ggml.c:(.text+0x16960): undefined reference to `pthread_create' /usr/bin/ld: ggml.c:(.text+0x169c3): undefined reference...

mmyjona

Adding missing features of CMakeLists.txt & Refactoring

2

We have come up with several changes to CMakeLists.txt that are expected to improve performance, compatibility, and maintainability, and we have drafted them. Change list : 1. remove _NO_ from...

nusu-github

enhancement

build

llama.cpp
llama.cpp copied to clipboard

Metadata

Parallel Quantize.sh, add &

Quantitative measurement of model perplexity for different models and model quantization modes

Create a logo

Installation Fails on M1 Mac Air

Any way to change context limit?

Build on Debian Docker

Unhandled exception: _Xlength_error("string too long")

android port of llama.cpp

add ptread link to fix cmake build under linux

Adding missing features of CMakeLists.txt & Refactoring

← Metadata

Owner

Metadata

llama.cpp llama.cpp copied to clipboard

Metadata

← Metadata

Owner

Metadata

llama.cpp
llama.cpp copied to clipboard