llama-cpp-python Feature: Lightweight llama_cpp.server Docker Image Build Workflow

Feature: Lightweight llama_cpp.server Docker Image Build Workflow

Open devcxl opened this issue 10 months ago • 3 comments

By adding this workflow, the Docker image it builds becomes more lightweight. You can see that the built image is one-tenth the size of the original image.

Apr 05 '24 16:04 devcxl

docker run --network host --rm -it -v ./llama-server/:/data/ ghcr.io/devcxl/llama-cpp-python:light-server --host 0.0.0.0 --model /data/models/Qwen1.5/qwen1_5-0_5b-chat-q2_k.gguf By using this command, you can run it directly

Apr 05 '24 17:04 devcxl

@Smartappli Okay, I will make these changes later today

May 07 '24 06:05 devcxl

@abetlen Can you approve testing plz?

May 23 '24 16:05 Smartappli

llama-cpp-python llama-cpp-python copied to clipboard

Feature: Lightweight llama_cpp.server Docker Image Build Workflow

llama-cpp-python
llama-cpp-python copied to clipboard