llama.cpp
llama.cpp copied to clipboard
[Proposal] "Stable" C API
I propose refactoring main.cpp
into a library (llama.cpp
, compiled to llama.so
/llama.a
/whatever) and making main.cpp
a simple driver program. A simple C API should be exposed to access the model, and then bindings can more easily be written for Python, node.js, or whatever other language.
This would partially solve #82 and #162.