dash-infer
dash-infer copied to clipboard
CPU specifications for Qwen2-7B models
I would like to know what should it be the minimum requirement specifications for running 7B models. I have this configuration and only can run 1.5B with average throughput 8.1 token/s.