CJ Pais

Results 2 issues of CJ Pais

Ported the code from [llama.cpp PR 5896](https://github.com/ggerganov/llama.cpp/pull/5896) Should address [llama.cpp 5852](https://github.com/ggerganov/llama.cpp/issues/5852) and [llama.cpp 5863](https://github.com/ggerganov/llama.cpp/issues/5863) To fix, we set the number of tokens processed to it's correct value in ingest_images where...

This PR adds whisper.cpp support to llamafile. This addresses #17 in part. Only the server binary has been ported in this PR. Most of the work to support this was...

llama.cpp
llamafile