Brian comments

Results 224 comments of


                                            Brian

common: free ctx_gguf when exiting llama_control_vector_load_one

do we use valgrind here and should we?

common: free ctx_gguf when exiting llama_control_vector_load_one

@stevegrubb looks like it might take a while for https://github.com/ggerganov/llama.cpp/pull/6289 to be merged in. You may want to check if any of the leaks fixes from there is directly related...

Server enhancements - grammar segfault and helper titles.

Is this bug still present? Just chasing up older PRs to make sure it's not obsolete

ggml-qnn: add Qualcomm QNN(Qualcomm Neural Network,aka Qualcomm AI Engine Direct) backend

@zhouwg attempted to resolve the conflict, but you may want to consider rebasing anyway.

ggml-qnn: add Qualcomm QNN(Qualcomm Neural Network,aka Qualcomm AI Engine Direct) backend

@zhouwg you mean writing directly to master? Well no. But been helping out at least with triaging, which should still be helpful? If you noticed, I've been putting labels everywhere...

Improve and expand Wikipedia article about llama.cpp

Don't forget https://github.com/ggerganov/llama.cpp/wiki as well, I've reorganised the sidebar to be a bit clearer. The wiki will be useful for information that is a bit too technical for wikipedia.

new tokenizer-verifier tool to check gguf tokenizer parameters

``` $ ./build/bin/tokenizer-verifier ./models/ggml-vocab-aquila.gguf WARNING: Behavior may be unexpected when allocating 0 bytes for ggml_calloc! llama_model_loader: loaded meta data with 18 key-value pairs and 0 tensors from ./models/ggml-vocab-aquila.gguf (version GGUF...

Brian

common: free ctx_gguf when exiting llama_control_vector_load_one

common: free ctx_gguf when exiting llama_control_vector_load_one

Server enhancements - grammar segfault and helper titles.

ggml-qnn: add Qualcomm QNN(Qualcomm Neural Network,aka Qualcomm AI Engine Direct) backend

ggml-qnn: add Qualcomm QNN(Qualcomm Neural Network,aka Qualcomm AI Engine Direct) backend

Improve and expand Wikipedia article about llama.cpp

new tokenizer-verifier tool to check gguf tokenizer parameters

gguf.md: Add GGUF Naming Convention Section

gguf.md: Add GGUF Naming Convention Section

gguf.md: Add GGUF Naming Convention Section