fastLLaMa
fastLLaMa copied to clipboard
Port llama.cpp openCL support to fastllama?
Llama.cpp somewhat recently added support of openCL acceleration, enabling hardware-acelleration on AMD GPUs. Could it be possible to do the same thing?