blog icon indicating copy to clipboard operation
blog copied to clipboard

Add: fsa/flash-llm.md

Open AdamG012 opened this issue 1 year ago • 1 comments

Thank you very much

AdamG012 avatar Oct 05 '23 11:10 AdamG012

Hello there to detail this blog, it is work by @Summer-Summer at FSA-Lab and others at Alibaba Research. The source code can be found at https://github.com/AlibabaResearch/flash-llm and https://github.com/usyd-fsalab/flash-llm. This work is a large scale LLM inference library focusing on GPU code optimisations for sparse matrices.

@osanseviero @sayakpaul Let us know if there is anything you need.

AdamG012 avatar Oct 25 '23 23:10 AdamG012