Georgi Gerganov
Georgi Gerganov
target #9707 Adapt the Metal backend to the new registry and device interfaces. - [x] I have read the [contributing guidelines](https://github.com/ggerganov/llama.cpp/blob/master/CONTRIBUTING.md) - Self-reported review complexity: - [ ] Low -...
I was just thinking about this idea, so writing it down for future research. We should be able to fairly easy generate model-specific Metal code that has hardcoded kernels for...
Following up on #2421, I think we should implement some better way to observe at which point of the inference the results start to deviate significantly between the classical and...
# Overview This is a list of changes to the public interface of the `llama` library. Collaborators are encouraged to edit this post in order to reflect important changes to...
## Overview This PR is an intermediate step towards a more generic implementation that will support different underlying implementations of `llama_kv_cache`, `llama_context` and the graph building logic (a.k.a. `llm_build_context`). The...
According to this https://github.com/ggerganov/llama.cpp/discussions/336#discussioncomment-11184134, there is a new CoreML API and an ANE backend might be possible to implement with latest Apple software/hardware.
Depends on: https://github.com/ggerganov/llama.cpp/issues/5214 The `llamax` library will wrap `llama` and expose common high-level functionality. The main goal is to ease the integration of `llama.cpp` into 3rd party projects. Ideally, most...
Currently, the [ggml](https://github.com/ggerganov/ggml) source tree is synchronized into the source tree of `whisper.cpp`: https://github.com/ggerganov/whisper.cpp/tree/master/ggml This is not ideal, because the `ggml` source code is duplicated and has to be synchronized...
It would be nice to start measuring the word error rate (WER) of `whisper.cpp` across some representative dataset: - short audio - long audio - english - non-english - etc....
The Emscripten build for the WASM examples hasn't been maintained for a very long time. There are likely regressions that have been introduced that would need to be fixed. Last...