Zhiyuan Li comments

Results 63 comments of


                                            Zhiyuan Li

enhance fla support for RWKV6

Your suggestion makes a lot of sense. Some of these changes were introduced by the edittor. I'll try to first limit the changes to chunkrwkv6 and fix the test

enhance fla support for RWKV6

[checkrwkv6.tar.gz](https://github.com/user-attachments/files/16608127/checkrwkv6.tar.gz) Here are the codes that compare CUDA with FLA.

Also, this pull request fixed https://github.com/sustcsonglin/flash-linear-attention/issues/29 The problem was introduced by bfloat16 when calculating dq and dk. By converting to float32 when necessary and using tf32 as much as possible,...

enhance fla support for RWKV6

> @uniartisan Hi, just make some reviews, could you have a check? hi. I can't see any comments, could you tell me where could I have a check?

enhance fla support for RWKV6

![image](https://github.com/user-attachments/assets/764749cb-0393-4c57-8099-f5452d6a81f7) Could you give me a review like this? https://github.com/sustcsonglin/flash-linear-attention/pull/44/files/4a3e2bb1d699c7e41ead7adc2f2403fb3e79ceb6 I can't see your msgs :(