refact
refact copied to clipboard
add refact llama.cpp tutorial
/claim #77
reference PR: https://github.com/ggerganov/llama.cpp/pull/3329
@olegklimov please help review and feel free to test. The inference is extremely fast with the effort from llama.cpp.
@ds5t5
It is really fast! Nice work!