llama.cpp issues

Support for Loading a Subset of Tensors for LoRA Models

6

Firstly, thank you for the awesome project. I'm new to LLMs so I hope this suggestion makes sense. LoRA is a technique used to reduce the number of parameters during...

skeskinen

enhancement

🦙.

model

SHA256 checksums correctness

2

> Not all of these checksums seem to be correct. Are they calculated with the "v2" new model format after the tokenizer change? PR: https://github.com/ggerganov/llama.cpp/pull/252 Issue: https://github.com/ggerganov/llama.cpp/issues/324 > > For...

anzz1

bug

model

Add proper instructions for using Alpaca models

21

So I am looking at https://github.com/antimatter15/alpaca.cpp and I see they are already running 30B Alpaca models, while we are struggling to run 7B due to the recent tokenizer updates. I...

ggerganov

documentation

help wanted

good first issue

high priority

🦙.

Fix Nix build

1

We might want to add a Nix CI job to ensure it doesn't get desynced. @prusnak thoughts?

siraben

build

CI fix Windows, make sure build passes before running tests

1

Otherwise the tests may be ran and pass even if the build has errors and thus the step itself can pass with errors in build

anzz1

bug

build

Original weights for LLAMA

2

Hey, I noticed the API is running on CPP, were the original weights in python or CPP? If in python, I would think they were in pytorch since that is...

NickDatLe

question

need more info

llama_init_from_file: failed to load model

1

When I execute this command： make -j && ./main -m ./models/7B/ggml-model-q4_0.bin -p "Building a website can be done in 10 simple steps:" -n 512 An error was reported： llama_init_from_file: failed...

alisonzhu

need more info

The initial token is always empty.

6

Hello, I noticed something when trying the chat with Bob is that I always get the first token as empty. 1 -> '' 4103 -> ' Trans' 924 -> 'cript'...

BadisG

need more info

generation quality

[Documentation] C API examples

11

Hey! There should be a simple example on how to use the new C API (like one that simply takes a hardcoded string and runs llama on it until \n...

niansa

documentation

Fix quantize script not finding models in parent directory

2

Previously, `python quantize.py --models-path .. 7B 13B` would fail to find `../7B/ggml-model-f16.bin` Now, it computes the absolute path to the models and uses that instead which works.

j-f1

llama.cpp
llama.cpp copied to clipboard

Metadata

Support for Loading a Subset of Tensors for LoRA Models

SHA256 checksums correctness

Add proper instructions for using Alpaca models

Fix Nix build

CI fix Windows, make sure build passes before running tests

Original weights for LLAMA

llama_init_from_file: failed to load model

The initial token is always empty.

[Documentation] C API examples

Fix quantize script not finding models in parent directory

← Metadata

Owner

Metadata

llama.cpp llama.cpp copied to clipboard

Metadata

← Metadata

Owner

Metadata

llama.cpp
llama.cpp copied to clipboard