deepseek topic

List deepseek repositories

inference

4.9k
Stars
391
Forks
39
Watchers

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...

Awesome-LLM-Inference

2.6k
Stars
175
Forks
Watchers

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

ms-swift

3.6k
Stars
310
Forks
12
Watchers

Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Visio...

inferflow

235
Stars
24
Forks
Watchers

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

aicommit2

119
Stars
9
Forks
Watchers

A Reactive CLI that generates git commit messages with Ollama, ChatGPT, Gemini, Claude, Mistral and other AI

deepseek-free-api

312
Stars
102
Forks
Watchers

🚀 DeepSeek-V2大模型逆向API白嫖测试【特长:GPT4平替】,支持高速流式输出、多轮对话,零配置部署,多路token支持。

rag-gpt

315
Stars
51
Forks
Watchers

RAG-GPT, leveraging LLM and RAG technology, learns from user-customized knowledge bases to provide contextually relevant answers for a wide range of queries, ensuring rapid and accurate information re...

bigcodebench

187
Stars
22
Forks
Watchers

BigCodeBench: Benchmarking Code Generation Towards AGI

cherry-studio

2.5k
Stars
135
Forks
18
Watchers

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers

obsidian-tars

41
Stars
5
Forks
Watchers

Obsidian plugin that supports text generation based on tag suggestions, using services like Claude, OpenAI, Ollama, Kimi, Doubao, Qwen, Zhipu, DeepSeek, QianFan & more. 插件基于标签建议进行文本生成,...