qdurllm icon indicating copy to clipboard operation
qdurllm copied to clipboard

Cease support for llama.cpp-served Gemma

Open AstraBert opened this issue 1 year ago • 0 comments

Reference to #2 but also to the inefficiency of the solution

Explore new local serving methods like quantization (non-dockerizabble) and llama.cpp python package

AstraBert avatar Jan 03 '25 11:01 AstraBert