纪焘 (Tao Ji)

Results 1 repositories owned by 纪焘 (Tao Ji)

MHA2MLA

200
Stars
21
Forks
200
Watchers

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs