efficient-attention topic

List efficient-attention repositories

cosformer-pytorch

43
Stars
8
Forks
Watchers

Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".

Compact-Global-Descriptor

25
Stars
7
Forks
Watchers

Pytorch implementation of "Compact Global Descriptor for Neural Networks" (CGD).

CoLT5-attention

218
Stars
12
Forks
Watchers

Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch

Infini-Attention

58
Stars
5
Forks
Watchers

Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M context keypass retrieval

ring-attention-pytorch

428
Stars
25
Forks
Watchers

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch