attention-sink topic

List attention-sink repositories

intel-extension-for-transformers

2.0k
Stars
189
Forks
Watchers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡