mlx-llm-server Benchmarks?

Benchmarks?

Open krzysiekpodk opened this issue 1 year ago • 1 comments

Do you have some benchmarks against llama.cpp?

Feb 20 '24 11:02 krzysiekpodk

It will be slower than llama.cpp, given that MLX is a general machine learning framework and not specialized for LLM inference. However, MLX is actively working on improving performance; I believe it will improve significantly in the future.

Feb 20 '24 14:02 mzbac

mlx-llm-server mlx-llm-server copied to clipboard

Benchmarks?

mlx-llm-server
mlx-llm-server copied to clipboard