Baichuan-7B icon indicating copy to clipboard operation
Baichuan-7B copied to clipboard

想问一下在A800上测试的吞吐量,换算到推理速度的话有多少tokens/s?

Open HJT9328 opened this issue 1 year ago • 0 comments

Required prerequisites

Questions

7B模型实现了A800 上单卡吞吐的情况下实现了 70tokens/s 比较怀疑,

Checklist

  • [X] I have provided all relevant and necessary information above.
  • [X] I have chosen a suitable title for this issue.

HJT9328 avatar Nov 21 '23 10:11 HJT9328