Cease support for llama.cpp-served Gemma

Open AstraBert opened this issue 1 year ago • 0 comments

Reference to #2 but also to the inefficiency of the solution

Explore new local serving methods like quantization (non-dockerizabble) and llama.cpp python package

Jan 03 '25 11:01 AstraBert