llama-cpp-python
llama-cpp-python copied to clipboard
Feature: Lightweight llama_cpp.server Docker Image Build Workflow
By adding this workflow, the Docker image it builds becomes more lightweight.
You can see that the built image is one-tenth the size of the original image.
docker run --network host --rm -it -v ./llama-server/:/data/ ghcr.io/devcxl/llama-cpp-python:light-server --host 0.0.0.0 --model /data/models/Qwen1.5/qwen1_5-0_5b-chat-q2_k.gguf
By using this command, you can run it directly
@Smartappli Okay, I will make these changes later today
@abetlen Can you approve testing plz?