Alpaca-LoRA-Serve
Alpaca-LoRA-Serve copied to clipboard
Enable MPS inference for Apple silicon
Could MPS support be added to enable faster inference using Apple silicon? See https://github.com/tloen/alpaca-lora/pull/48 for an example implementation using the original 7B Alpaca-LoRA checkpoint.