Flex Wang

Results 5 comments of Flex Wang

> @a123775 @flexwang2 This has been resolved in the latest release. Thanks a lot bro!

We would like to see this chunkation on trition side feature will go live

@byshiue can I know why we don't use flash attention for encoder model as well?