TensorRT-LLM icon indicating copy to clipboard operation
TensorRT-LLM copied to clipboard

What's the throughput of R1 671B using bs=1 without quant?

Open ghostplant opened this issue 9 months ago • 0 comments

For h200, what's the throughput of R1 671B using bs=1 without quant?

ghostplant avatar Mar 17 '25 12:03 ghostplant