Flex Wang
Results
5
comments of
Flex Wang
> @a123775 @flexwang2 This has been resolved in the latest release. Thanks a lot bro!
We would like to see this chunkation on trition side feature will go live
Answer is yes
@byshiue can I know why we don't use flash attention for encoder model as well?