deepseek topics

inference

2.9k

Stars

239

Forks

Watchers

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...

xorbitsai

artificial-intelligence

chatglm

chatglm2

deepseek

Awesome-LLM-Inference

1.5k

Stars

118

Forks

Watchers

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

awq