hollama
hollama copied to clipboard
Show response tokens per second rate
To calculate how fast the response is generated in tokens per second (token/s), divide
eval_count
/eval_duration
.
https://github.com/ollama/ollama/blob/main/docs/api.md#response