llama.cpp issues

Refactor most code in main.cpp into a separate module (preparing to implement TCP mode)

5

The goal of this refactor is allow reusing the model execution while using streams other than stdin/stdout for interaction. In my case, I'd like to implement a simple TCP server...

tarruda

Create issue template for bug and enhancement issues

1

The following is a proposed template for creating new issues. If people think the tone could be improved, I'd appreciate feedback! ___ # Prerequisites Please answer the following questions for...

gjmulder

documentation

good first issue

Batch size affects model's output

9

I was tinkering with the code and made the following change in `line 977, main.cpp` (as it seemed wrong to me): *from* ```C if (embd.size() > params.n_batch) { break; }...

oKatanaaa

bug

generation quality

How do I get input embeddings?

8

I am trying to output just the sentence embedding for a given input, instead of any new generated text. I think this should be rather straightforward but figured someone more...

StrikingLoo

question

Scale buf_size linearly with n_ctx

17

This appear to solve https://github.com/ggerganov/llama.cpp/issues/153 where error of `ggml_new_tensor_impl: not enough space in the context's memory pool` is thrown in interactive mode, if using a larger context size. At least...

hx507

bug

high priority

Trace output distributions to a log file

I do not expect this to be merged, but I figured it might help others. Although, I don't know if this is the right place. This logs information to a...

Piezoid

enhancement

No output after commit 84d9015 on Android

4

### Discussed in https://github.com/ggerganov/llama.cpp/discussions/234 Originally posted by **ShouNichi** March 17, 2023 When `git checkout 84d9015` and `make`, there will be no output (only the model loading message) in termux. `git...

gjmulder

bug

need more info

makefile: Fix CPU feature detection on Haiku

1

kallisti5

bug

build

Docker - Fix publish docker image in GitHub Registry

1

In the PR that was resolved (#132), the action defined to publish the packages used the user and token of the author of the commit in master. In this case,...

bernatvadell

enhancement

good first issue

build

Study how LM Evaluation Harness works and try to implement it

2

It would be great to start doing this kind of quantitative analysis of `ggml`-based inference: https://bellard.org/ts_server/ It looks like Fabrice evaluates the models using something called LM Evaluation Harness: https://github.com/EleutherAI/lm-evaluation-harness...

ggerganov

enhancement

high priority

generation quality

research 🔬

llama.cpp
llama.cpp copied to clipboard

Metadata

Refactor most code in main.cpp into a separate module (preparing to implement TCP mode)

Create issue template for bug and enhancement issues

Batch size affects model's output

How do I get input embeddings?

Scale buf_size linearly with n_ctx

Trace output distributions to a log file

No output after commit 84d9015 on Android

makefile: Fix CPU feature detection on Haiku

Docker - Fix publish docker image in GitHub Registry

Study how LM Evaluation Harness works and try to implement it

← Metadata

Owner

Metadata

llama.cpp llama.cpp copied to clipboard

Metadata

← Metadata

Owner

Metadata

llama.cpp
llama.cpp copied to clipboard