deepseek topic

List deepseek repositories

inference

8.8k
Stars
767
Forks
8.8k
Watchers

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready...

Awesome-LLM-Inference

2.6k
Stars
175
Forks
Watchers

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

ms-swift

11.6k
Stars
1.1k
Forks
11.6k
Watchers

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4,...

inferflow

235
Stars
24
Forks
Watchers

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

aicommit2

448
Stars
38
Forks
448
Watchers

A Reactive CLI that generates commit messages for Git and Jujutsu with Ollama, ChatGPT, Gemini, Claude, Mistral and other AI

deepseek-free-api

2.7k
Stars
797
Forks
2.7k
Watchers

🚀 DeepSeek-V3 & R1大模型逆向API【特长:良心厂商】(官方贼便宜,建议直接走官方),支持高速流式输出、多轮对话,联网搜索,R1深度思考,零配置部署,多路token支持,仅供测试,如需商用请前往官方开放平台。

rag-gpt

478
Stars
82
Forks
478
Watchers

RAG-GPT, leveraging LLM and RAG technology, learns from user-customized knowledge bases to provide contextually relevant answers for a wide range of queries, ensuring rapid and accurate information re...

bigcodebench

187
Stars
22
Forks
Watchers

BigCodeBench: Benchmarking Code Generation Towards AGI

cherry-studio

2.5k
Stars
135
Forks
Watchers

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers

obsidian-tars

41
Stars
5
Forks
Watchers

Obsidian plugin that supports text generation based on tag suggestions, using services like Claude, OpenAI, Ollama, Kimi, Doubao, Qwen, Zhipu, DeepSeek, QianFan & more. 插件基于标签建议进行文本生成,...