ZCHNO

Results 2 issues of ZCHNO

Hi, I notice that you use KV-cache with FlashAttention in CausalSelfAttention. As far as I am concerned, FlashAttention has already implemented the causal self-attention in its kernels, which means for...

**Describe the bug** A clear and concise description of what the bug is. Can't find libnccl.so when building from source. It seems flux only builds static nccl lib instead of...