Alpaca-LoRA-Serve
Alpaca-LoRA-Serve copied to clipboard
ggml-alpaca-3b-q4 on CPU?
Is it possible to run ggml-alpaca-3b-4q.bin model on cpu ram? And specify filepath instead of url to hf?