Baichuan-7B
Baichuan-7B copied to clipboard
想问一下在A800上测试的吞吐量,换算到推理速度的话有多少tokens/s?
Required prerequisites
- [X] I have read the documentation https://github.com/baichuan-inc/baichuan-7B/blob/HEAD/README.md.
- [X] I have searched the Issue Tracker and Discussions that this hasn't already been reported. (+1 or comment there if it has.)
- [X] Consider asking first in a Discussion.
Questions
7B模型实现了A800 上单卡吞吐的情况下实现了 70tokens/s 比较怀疑,
Checklist
- [X] I have provided all relevant and necessary information above.
- [X] I have chosen a suitable title for this issue.