mixture-of-depths topic
List
mixture-of-depths repositories
infini-transformer
262
Stars
22
Forks
Watchers
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)