llmperf-leaderboard
llmperf-leaderboard copied to clipboard
Throughput of llama2 70b higher than llama2 7b
I was wondering how to understand this. I would expect llama2 70b to have a lower throughput.
Is the configuration different between the table for llama2 70b and the table for llama2 7b.