mixture-of-depths topic

List mixture-of-depths repositories

infini-transformer

262
Stars
22
Forks
Watchers

PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)