llama.cpp
llama.cpp copied to clipboard
use fprintf for diagnostic output, keep printf only for printing model output
one can use ./main ... 2>dev/null
to suppress any diagnostic output
Fixes https://github.com/ggerganov/llama.cpp/issues/5