[email protected]
United States of America Next Generation AI algorithms and systems
Infini-AI-Lab
scalable and robust tree-based speculative decoding algorithm
[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding