StrangeBytesDev
Results
3
comments of
StrangeBytesDev
I think this function in llama.cpp might be the right one to call to try to implement this. https://github.com/ggerganov/llama.cpp/blob/b2440/llama.cpp#L14010 But I've never done any kind of C++ to Nodejs bindings...
> * The main problem is that it holds the entire context state and not just the evaluation cache of the tokens used in a specific context sequence, so it...
You can do ```Typescript response: { 200: z.null().describe('No Content'), } ```