Baichuan-7B
Baichuan-7B copied to clipboard
[Question] 请问7B没有用上FlashAttention吗?
Required prerequisites
- [X] I have read the documentation https://github.com/baichuan-inc/baichuan-7B/blob/HEAD/README.md.
- [X] I have searched the Issue Tracker and Discussions that this hasn't already been reported. (+1 or comment there if it has.)
- [X] Consider asking first in a Discussion.
Questions
请问7B没有用上FlashAttention吗?看了下7B代码,没发现这块的逻辑。
Checklist
- [X] I have provided all relevant and necessary information above.
- [X] I have chosen a suitable title for this issue.
No. We user xformers for training, and naive impl for inference.