transformers icon indicating copy to clipboard operation
transformers copied to clipboard

Add Model Support for xLSTM

Open stefan-it opened this issue 2 years ago • 4 comments

Model description

Inspired by recent rumors about xLSTM - a hidden successor to LSTM - by Sepp Hochreiter, this issue tracks the open source implementation about adding xLSTM to Transformers library.

Open source status

  • [ ] The model implementation is available
  • [ ] The model weights are available

Provide useful links for the implementation

  • [x] Paper is available here

At the moment no implementation does exist.

Only rumors that xLSTM surpasses GPT-2 on various (small) downstream datasets.

Good overview is the xLSTM Resources repository from @AI-Guru.

stefan-it avatar Oct 23 '23 10:10 stefan-it

Sounds like a money grab. If it is something useful, he should have chosen the academic path or at least filing patent.

This way of boldly claiming success via non-serious media channels is highly unprofessional. It smells like publicity is more relevant than results which further supports motivations like funding/personal gains/politics.

Pythoniasm avatar Oct 23 '23 11:10 Pythoniasm

If I understood it correctly, a patent is on its way, and at least a paper about xLSTM will be published in less than 6 month.

DavidFarago avatar Jan 31 '24 20:01 DavidFarago

I have some doubts if this is planned as an open source model.

KnutJaegersberg avatar Feb 02 '24 10:02 KnutJaegersberg

Paper is published now: https://arxiv.org/abs/2405.04517

albertz avatar May 08 '24 07:05 albertz

Need code and checkpoint or it didn't happen.

Ghost---Shadow avatar May 30 '24 14:05 Ghost---Shadow

Official implementation is out now:

https://github.com/NX-AI/xlstm

stefan-it avatar Jun 04 '24 06:06 stefan-it

Note that the official source code is AGPL-licensed.

danthe1st avatar Jun 04 '24 08:06 danthe1st